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Change or Add 


2-2 

Table 2-1: 

Line 

Function (Now) 

Function (Should Be) 


1 

XII 

XA11 


2 

X12 

XA12 


5 

X0D2 

XD2 


20 

IOA5 

XA5 


26 

IOD23 

XD23 


27 

IOD24 

XD24 


28 

IOD25 

XD25 


28 

VSUBS 

SUBS 


29 

IOD26 

XD26 


30 

IOD27 

XD27 


31 

IOD28 

XD28 


32 

IOD29 

XD29 


33 

IOD30 

XD30 


34 

IOD31 

XD31 


35 

lORDY 

XRDY 


2-6 Table 2-2. Insert the following at the end of the table. 



7-9 Line 7: src should be dst. 

A-5 Table A-5: Characteristics (13), (14), (15), (16), (17), and (18) change (10) to (X) 

in name and description. 

A-6 Figure A-4: Change (IO)R/W to (X)R/W, (IO)A to (X)A, (IO)D to (X)D, and 

(lO)RDY to (X)RDY. 

A-6 Table A-6: All characteristics change (10) to (X) in name and description. 

A-7 Figure A-5: Change (IO)R/W to (X)R/W, (IO)A to (X)A, (IO)D to (X)D, and 

(lO)RDY to (X)RDY. 

A-8 Figure A-6: Change lOR/W to XR/W, lOA to XA, lOD to XD, and lORDY to 

XRDY. Change (M)STRB in title to lOSTRB. 

A-9 Table A-7: Characteristics (22), (14.1), (15.1), (16.1), (17.1), and (18.1) change 10 

to X in name and description. 

A-9 Table A-8: All characteristics change 10 to X in name and description. 

The changes shown in this Manual Update will be included in the next revision of the Third-Generation 

TMS320 User's Guide. 
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Section 1 


Introduction 


The TMS320C30 (third-generation) Digital Signal Processor (DSP) is a 
high-performance CMOS 32-bit device in the TMS320 family of single-chip 
digital signal processors. Since 1982 when the TMS32010 was introduced, 
the TMS320 family has established Itself as the industry standard for digital 
signal processing. Powerful Instruction sets, high-speed number-crunching 
capabilities, and innovative architectures have made this high-performance 
family of processors ideal for DSP applications. 

The TMS320 family consists of three generations of processors: TMS320C1 x, 
TMS320C2X, and TMS320C3x (see Figure 1 -1). The family has expanded to 
include enhancements of earlier generations and more powerful new gener¬ 
ations of digital signal processors. 



Figure 1-1. TMS320 Device Evolution 
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Introduction 
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This document discusses the third-generation device, TMS320C30, within the 
TMS320 family. The 60-ns cycle time of the TMS320C30 allows It to execute 
operations at a performance rate previously available only on a supercomputer. 
Even higher performance is gained through Its large on-chip memories, con¬ 
current DMA controller, and Instruction cache. 

This section presents the following major topics: 

• General Description (Section 1.1 on page 1-3) 

• Key Features (Section 1.2 on page 1 -4) 

• Typical Applications (Section 1.3 on page 1 -5) 

• How To Use This Manual (Section 1.4 on page 1 -6) 

• References (Section 1.5 on page 1 -8) 
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Introduction - General Description 


1.1 General Description 

The TMS320's internal busing and special digital signal processing (DSP) 
instruction set provide speed and flexibility.This combination produces a pro¬ 
cessor family capable of executing up to 33 MFLOPS (million floating-point 
operations per second). The TMS320 family optimizes speed by Implementing 
functions in hardware that other processors implement through software or 
microcode. This hardware-intensive approach provides the design engineer 
with power previously unavailable on a single chip. 

The TMS320C30, the third-generation device In the TMS320 family, can 
perform parallel multiply and ALU operations on integer or floating-point data 
in a single cycle. The processor also possesses a general-purpose register file, 
program cache, dedicated auxiliary register arithmetic units (ARAU), internal 
dual-access memories, one DMA channel supporting concurrent I/O, and a 
short machine-cycle time. High performance and ease of use are achieved 
through greater parallelism, greater accuracy, and general-purpose features. 

General-purpose applications are greatly enhanced by the large address space, 
multiprocessor interface. Internally and externally generated wait states, two 
timers, two serial ports, and multiple Interrupt structure. The TMS320C30 
supports a wide variety of system applications from host processor to dedi¬ 
cated coprocessor. 

The emphasis on total system cost has resulted In a less-expensive processor 
that can be designed into systems currently using costly bit-slice processors. 
High-level language is more easily supported through a register-based archi¬ 
tecture, large address space, powerful addressing modes, flexible instruction 
set, and support of floating-point arithmetic. 
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1.2 Key Features 

Some key features of the TMS320C30 are listed below. 

• SO-ns single-cycle Instruction execution time 

- 33.3 MFLOPS (million floating-point operations per second) 

- 16.7 MIPS (million instructions per second) 

• One 4K x 32-bit single-cycle dual-access on-chip ROM block 

• Two 1K X 32-bit single-cycle dual-access on-chip RAM blocks 

• 64 X 32-bit Instruction cache 

• 32-blt instruction and data words, 24-bit addresses 

• 40/32-bit floating-point/integer multiplier and ALU 

• 32-bit barrel shifter 

• Eight extended-precision registers (accumulators) 

• Two address generators with eight auxiliary registers and two auxiliary 
register arithmetic units 

• On-chip Direct Memory Access (DMA) controller for concurrent I/O and 
CPU operation 

• Integer, floating-point, and logical operations 

• Two- and three-operand Instructions 

• Parallel ALU and multiplier instructions in a single cycle 

• Block repeat capability 

• Zero-overhead loops with single-cycle branches 

• Conditional calls and returns 

• Interlocked Instructions for multiprocessing support 

• Two serial ports to support 8/16/32-bit transfers 

• Two 32-bit timers 

• Two general-purpose external flags, four external interrupts 

• 180-pin grid array (PGA) package; 1 m m CMOS 
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1.3 Typical Applications 

The TMS320 family's unique versatility and realtime performance offer flexible 
design approaches in a variety of applications. In addition, TMS320 devices 
can simultaneously provide the multiple functions often required in those 
complex applications. Table 1 -1 lists typical TMS320 family applications. 

Table 1-1. Typical Applications of the TMS320 Family 


GENERAL-PURPOSE DSP 

GRAPHICS/IMAGING 

INSTRUMENTATION 

Digital Filtering 

Convolution 

Correlation 

Hilbert Transforms 

Fast Fourier Transforms 

Adaptive Filtering 

Windowing 

Waveform Generation 

3-D Rotation 

Robot Vision 

Image Transmission/ 

Compression 

Pattern Recognition 

Image Enhancement 

Homomorphic Processing 
Workstations 

Animation/Digital Map 

Spectrum Analysis 

Function Generation 

Pattern Matching 

Seismic Processing 

Transient Analysis 

Digital Filtering 

Phase-Locked Loops 

VOICE/SPEECH 

CONTROL 

MILITARY 

Voice Mail 

Speech Vocoding 

Speech Recognition 

Speaker Verification 

Speech Enhancement 

Speech Synthesis 

Text-to-Speech 

Neural Networks 

Disk Control 

Servo Control 

Robot Control 

Laser Printer Control 

Engine Control 

Motor Control 

Kalman Filtering 

Secure Communications 
Radar Processing 

Sonar Processing 

Image Processing 

Navigation 

Missile Guidance 

Radio Frequency Modems 
Sensor Fusion 

TELECOMMUNICATIONS 

AUTOMOTIVE 

Echo Cancellation FAX 

ADPCM Transcoders Cellular Telephones 

Digital PBXs Speaker Phones 

Line Repeaters Digital Speech 

Channel Multiplexing Interpolation (DSI) 

1200 to 19200-bps Modems X.25 Packet Switching 

Adaptive Equalizers Video Conferencing 

DTMF Encoding/Decoding Spread Spectrum 

Data Encryption Communications 

Engine Control 

Vibration Analysis 

Antiskid Brakes 

Adaptive Ride Control 

Global Positioning 
Navigation 

Voice Commands 

Digital Radio 

Cellular Telephones 

CONSUMER 

INDUSTRIAL 

MEDICAL 

Radar Detectors 

Power Tools 

Digital Audio/TV 

Music Synthesizer 

Toys and Games 

Solid-State Answering 

Machines 

Robotics 

Numeric Control 

Security Access 

Power Line Monitors 

Visual Inspection 

Lathe Control 

CAM 

Hearing Aids 

Patient Monitoring 
Ultrasound Equipment 
Diagnostic Tools 

Prosthetics 

Fetal Monitors 

MR Imaging 
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1.4 


How To Use This Manual 

The purpose of this user's guide is to serve as a reference book for the 
TMS320C30 digital signal processor. This document is designed to provide 
information that assists managers and hardware/software engineers in appli¬ 
cation development. The first group of sections provides specific information 
about the architecture and hardware operation of the device. Later sections 
describe the software operation. Specific software and hardware applications 
are provided in Sections 12 and 13, respectively. Electrical specifications and 
mechanical data can be found in the data sheet (Appendix A). 


The following table lists each section and briefly describes the section con¬ 
tents. 


Section 2. 

Section 3. 

Section 4. 

Section 5. 

Section 6. 

Section 7. 

Section 8. 

Section 9. 
Section 10, 


Pinout and Signal Descriptions. Drawing of the PGA 
package for the TMS320C30. Functional listing of the 
signals, their pin locations, and descriptions. 

Architectural Overview. Functional block diagram. 
TMS320C30 design description, hardware components, 
and device operation. Instruction set summary. 

CPU Registers. Memory, and Cache. Description of the 
registers In the CPU register file. Memory maps provided 
and Instruction cache architecture, algorithm, and control 
bits explained. 

Data Formats and Floating-Point Operations. Description 
of signed and unsigned integer and floating-point formats. 
Discussion of floating-point multiplication, addition, sub¬ 
traction, normalization, rounding, and conversions. 

Addressing. Operation, encoding, and Implementation of 
addressing modes. Format descriptions. System stack 
management. 

Program Flow Control. Software control of program flow 
with repeat modes and branching. Interlocked operations. 
Reset and interrupts. 

External Bus Operation. Description of primary and expan¬ 
sion interfaces. External interface timing diagrams. Pro¬ 
grammable wait-states and bank switching. 

Peripherals. Description of the DMA controller, timers, and 
serial ports. 

Pipeline Operation. Discussion of the pipelining of oper¬ 
ations on the TMS320C30. 


Section 11. Assembly Language Instructions. Functional listing of in¬ 
structions. Condition codes defined. Alphabetized indi¬ 
vidual instruction descriptions with examples. 

Section 12. Software Applications. Software application examples for 
the use of various TMS320C30 Instruction set features. 
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Section 13. 


Hardware Applications. Hardware design techniques and 
application examples for interfacing to memories, periph¬ 
erals, or other microcomputers/microprocessors. 


1 


Four appendices are Included to provide additional information. 


Appendix A. 
Appendix B. 

Appendix C. 


TMS320C30 Data Sheet. Electrical specifications, timing, 
and mechanical data. 

Development Support/Part Order Information. Listings of 
the hardware and software available to support the 
TMS320C30 device. 

Instruction Opcodes. List of the opcode fields for all the 
TMS320C30 instructions. 


Appendix D. Quality and Reliability. Discussion of Texas Instruments 
quality and reliability criteria for evaluating performance. 
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1.5 References 

The following reference list contains useful information regarding functions, 
operations, and applications of digital signal processing. These books also 
provide other references to many useful technical papers. The reference list is 
organized into categories of general DSP, speech, image processing, and di¬ 
gital control theory, and alphabetized by author. 
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Gold, Bernard and Rader, C.M., Digital Processing of Signals. New 
York, NY: McGraw-Hill Company, Inc., 1969. 

Hamming, R.W., Digital Filters. Englewood Cliffs, NJ: Prentice-Hall, 
Inc., 1977. 

IEEE ASSP DSP Committee (Editor), Programs for Digital Signal Pro¬ 
cessing. New York, NY: IEEE Press, 1979. 

Jackson, Leland Digital Filters and Signal Processing. Hingham, MA: 
Kluwer Academic Publishers, 1986. 

Jones, D.L. and Parks, T.W., A Digital Signal Processing Laboratory 
Using the TMS32010. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1987. 

LIm, Jae and Oppenheim, Alan V. (Editors), Advanced Topics in Signal 
Processing. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1988. 

Morris, L. Robert, Digital Signal Processing Software. Ottawa, Canada: 
Carleton University, 1983. 

Oppenheim, Alan V. (Editor), Applications of Digital Signal Processing. 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1978. 

Oppenheim, Alan V. and Schafer, R.W., Digital Signal Processing. En¬ 
glewood Cliffs, NJ: Prentice-Hall, Inc., 1975. 

Oppenheim, Alan V. and Willsky, A.N. with Young, I.T., Signals and 
Systems. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1983. 

Parks, T.W. and Burrus, C.S., Digital Filter Design. New York, NY: John 
Wiley and Sons, Inc., 1987. 

Rablner, Lawrence R., Gold and Bernard Theory and Application of 
Digital Signal Processing. Englewood Cliffs, NJ: Prentice-Hall, Inc., 
1975. 
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Treichler, J.R., Johnson, Jr., C.R., and Larimore, M.G., Theory and De¬ 
sign of Adaptive Filters. New York, NY: John Wiley and Sons, Inc., 
1987. 

Speech: 

Gray, A.H. and Markel, J.D., Linear Prediction of Speech. New York, 
NY: Springer-Verlag, 1976. 

Jayant, N.S. and Noll, Peter, Digital Coding of Waveforms. Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1984. 

Papamichalis, Panos, Practical Approaches to Speech Coding. Engle¬ 
wood Cliffs, NJ: Prentice-Hall, Inc., 1987. 

Rabiner, Lawrence R. and Schafer, R.W., Digital Processing of Speech 
Signals. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1978. 

Image Processing: 

Andrews, H.C. and Hunt, B.R., Digital Image Restoration. Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1977. 

Gonzales, Rafael C. and Wintz, Paul, Digital Image Processing. Reading, 
MA: Addison-Wesley Publishing Company, Inc., 1977. 

Pratt, William K., Digital Image Processing. New York, NY: John Wiley 
and Sons, 1978. 

Digital Control Theory: 

Jacquot, R., Modern Digital Control Systems. New York, NY: Marcel 
Dekker, Inc., 1981. 

Katz, P., Digital Control Using Microprocessors. Englewood Cliffs, NJ: 
Prentice-Hall, Inc., 1981. 

Kuo, B.C., Digital Control Systems. New York, NY: Holt, Reinholt and 
Winston, Inc., 1980. 

Moroney, P., Issues in the Implementation of Digital Feedback Com¬ 
pensators. Cambridge, MA: The MIT Press, 1983. 

Phillips, C. and Nagle, H., Digital Control System Analysis and Design. 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1984. 
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Section 2 


Pinout and Signal Descriptions 


The TMS320C30 (third-generation TMS320) digital signal processor is 
available in a 180-pin grid array (PGA) package. The pinout of this package 
(Figure 2-1), and a functional listing of the signals, pin locations, and de¬ 
scriptions are provided in this section. Electrical specifications and mechanical 
data are given in the data sheet (Appendix A). 
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Figure 2-1. TMS320C30 Pin Assignments 



























Pinout and Signal Descriptions 


Table 2-1. TMS320C30 Pin Function Assignments 



2 


NOTE: 

1) ADVDD, DDVDD, lODVDD, MDVDD, and PDVDD pins (D4, D12, E8, H5, H11, L8, M4, and M12) 
are on a common plane internal to the device. 

2) VDD pins (D8, H4, H12, and M8) are on a common plane internal to the device. 

3) VSS, CVSS, and INSS pins (B2, B14, C8, H3, H13, N8, and P14) are on a common plane internal 
to the device. 

4) DVSS pins (C3, Cl 3, N3, and N13) are on a common plane Internal to the device. 
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Pinout and Signal Descriptions 


2.1 Signal Descriptions 

The signal descriptions for the TMS320C30 device in the microprocessor 
mode are provided in this section. Table 2-2 lists each signal, the number of ■ 
pins, function, and operating mode(s), i.e., input, output, or high-impedance I 
state as indicated by I, 0, or Z. Ail pins labelled 'NC' are not to be connected | 
by the user. A line over a signal name (e.g., RESET) Indicates that the signal 
is active low true at a logic '0' level. The signals in Table 2-2 are grouped 
according to function. 

Table 2-2. TMS320C30 Signal Descriptions 


SIGNAL 

# PINS 

l/o/zt 

DESCRIPTION 

1 PRIMARY BUS INTERFACE (61 PINS) | 

D(31-0) 

32 

l/O/Z 

32-bit data port of the primary bus Interface. 

A (23-0) 

24 

0/Z 

24-bit address port of the primary bus interface. 

R/W 

1 

0/Z 

Read/write signal for primary bus interface. This pin is high 
when a read is performed and low when a write is performed 
over the parallel interface. 

STRB 

1 

0/Z 

External access strobe for the primary bus interface. 

1157 

1 

1 

Ready signal. This pin indicates that the external device is 
prepared for a primary bus interface transaction to complete. 
As long as RDY is a logic high, the data and address buses 
of the primary bus Interface remain valid. 

HOLD 

1 

1 

Hold signal for primary bus interface. When HdLD is a logic 
low, anv ongoing transaction is completed. The A(23-0), 
D(31-0), STRB,, and R/W signals are placed In a high-lm- 
pedance state, and ail transactions over the primary bus in- 
terface are held until HOLD becomes a logic high. 

HOLDA 

1 

0 

Hold acknowledge signal for primary bus interface. This 
signal is generated in response to a logic low_on HOLD. It 
signals that A(23-0), D(31-0), STRB, and R/W are placed 
in a high-impedance state and all transactions over the bus 
will be held. HOLDA will be high in response to a logic 
high of HOLD, 

EXPANSION BUS INTERFACE (49 PINS) 

XD (31-0) 

32 

l/O/Z 

32-bit data port of the expansion bus Interface. 

XA (12-0) 

13 

0/Z 

13-bit address port of the expansion bus interface. 

XR/TO 

1 

0/Z 

Read/write signal for expansion bus interface. When a read 

Is performed, this pin Is held high; when a write is per¬ 
formed, this pin is low. 

MSTRB 

1 

0/Z 

External memory access strobe for the expansion bus inter¬ 
face. 

rOSTRB 

1 

0/Z 

Externa! I/O access strobe for the expansion bus interface. 

XRDY 

1 

1 

Ready signal. This pin indicates that the external device is 
prepared for an expansion bus Interface transaction to 
complete. As long as XRDY is high, the data and address 
buses of the expansion bus interface remain valid. 


f Input, Output, High-Impedance state. 
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Table 2-2. TMS320C30 Signal Descriptions (Continued) 


SIGNAL # PINS l/O/Zt DESCRIPTION 

CONTROL SIGNALS (9 PINS) 

RESET 

1 

1 

Reset. When this pin is a logic low, the device is placed in 
the reset condition. When reset becomes a logic high, exe¬ 
cution begins from the location specified by the reset vector. 

llTf(3-0) 

4 

1 

External interrupts. 


1 

0 

Interrupt acknowledge signal. lACK goes low during exe¬ 
cution of an lACK instruction. This can be used to indicate 
the beginning or end of an interrupt service routine. 

mc/MP 

1 

1 

Microcomputer/microprocessor mode pin. 

XF(I-O) 

2 

I/O 

External flag pins. These pins are formatted as I/O through 
a program instruction, and latched internally when used as 
output pins. They are used as general-purpose I/O pins or 
to support interlocked processor instructions. 

1 SERIAL PORTO SIGNALS (6 PINS) | 

CLKXO 

1 

I/O 

Serial port 0 transmit clock. This pin serves as the serial shift 
clock for the serial port 0 transmitter. 

DXO 

1 

0/Z 

Data transmit output. Serial port 0 transmits serial data on 
this pin. 

FSXO 

1 

I/O 

Frame synchronization pulse for transmit The FSXO pulse 
Initiates the transmit data process over pin DXO. 

CLKRO 

1 

I/O 

Serial port 0 receive clock.This pin serves as the serial shift 
clock for the serial port 0 receiver. 

DRO 

1 

1 

Data receive. Serial port 0 receives serial data via the DRO 
pin. 

FSRO 

1 

1 

Frame sychronization pulse for receive. The FSRO pulse ini¬ 
tiates the receive data process over DRO. 

1 SERIAL PORT 1 SIGNALS (6 PINS) | 

CLKX1 

1 

I/O 

Serial port 1 transmit clock. This pin serves as the serial shift 
clock for the serial port 1 transmitter. 

DX1 

1 

0/Z 

Data transmit output. Serial port 1 transmits serial data on 
this pin. 

FSX1 

1 

I/O 

Frame synchronization pulse for transmit. The FSX1 pulse 
initiates the transmit data process over pin DX1. 

CLKR1 

1 

I/O 

Serial port 1 receive clock. This pin serves as the serial shift 
clock for the serial port 1 receiver. 

DR1 

1 

1 

Data receive. Serial port 1 receives serial data via the DR1 
pin. 

FSR1 

1 

1 

Frame sychronization pulse for receive. The FSR1 pulse ini¬ 
tiates the receive data process over DR1. 


t Input, Output, High-impedance state. 
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Table 2>2. TMS320C30 Signal Descriptions (Continued) 


SIGNAL # PINS l/O/Zt DESCRIPTION 

TIMER 0 SIGNALS (1 PIN) 

TCLKO 

1 

I/O 

Timer clock. As an input, TCLKO is used by timer 0 to count 
external pulses. As an output pin, TCLKO outputs pulses 
generated by timer 0. 

1 TIMER 1 SIGNALS (1 PIN) | 

TCLK1 

1 

I/O 

Timer clock. As an input, TCLK1 is used by timer 1 to count 
external pulses. As an output pin, TCLK1 outputs pulses 
generated by timer 1. 

SUPPLY AND OSCILLATOR SIGNALS (29 PINS) 

Vdd(3-0) 

4 

I 

Four +5 V supply pins. 

IODVdd(I.O) 

2 

I 

Two +5 V supply pins. 

ADVdd(I.O) 

2 

I 

Two +5 V supply pins. 

PDVqd 

1 

I 

One +5 V supply pin. 

DDVdd(I.O) 

2 

I 

Two +5 V supply pins. 

MDVdd 

1 

I 

One +5 V supply pin. 

VssO-O) 

4 

I 

Four ground pins. 

DVss(3-0) 

4 

I 

Four ground pins. 

CVssd.O) 

2 

I 

Two ground pins. 

iVss 

1 

I 

One ground pin. 

VbBP 

1 

NC 

Vbb pump oscillator output. 

SUBS 

1 

I 

Substrate pin. Tie to ground. 

XI 

1 

0 

Output pin from the internal oscillator for the crystal. If a 
crystal Is not used, this pin should be left unconnected. 

X2/CLKIN 

1 

I 

Input pin to the internal oscillator from the crystal or a clock. 

HI 

1 

0 

External HI clock. This clock has a period equal to twice 
CLKIN. 

H3 

1 

0 

External H3 clock. This clock has a period equal to twice 
CLKIN. 


t Input, Output, High-impedance state. 
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Table 2-2. TMS320C30 Signal Descriptions (Concluded) 


{ SIGNAL 

# PINS 

l/O/Zt 

DESCRIPTION 

RESERVED (18 PINS) 

EMU(0-2) 

3 

1 

Reserved. Use pull-ups to +5 volts. See Section 13.5 

EMU3 

1 

0 

Reserved. See Section 13.5 

EMU4 

1 

1 

Reserved. Tie to +5 volts. 


2 

NC 

Reserved. 

RSV(O-IO) 

11 

1 

Reserved. Tie to +5 volts. 


t Input, Output, High-impedance state. 

The user must follow the connections specified for the reserved pins. All pull-up resistors must be 20 k 
ohms. All +5 volt supply pins must be connected to a common supply plane and all ground pins must 
be connected to a common ground plane. 
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Section 3 


Architectural Overview 


Emphasis on hardware and software system solutions to demanding arithmetic 
algorithms has resulted in the TMS320C30 architecture shown in Figure 3-1. 
High system performance is achieved through the accuracy and precision of 
the floating-point units, large on-chip memory, a high degree of parallelism, 
and the DMA controller. 

This section provides an architectural overview of the TMS320C30 processor. 
Major areas of discussion are listed below. 

• Central Processing Unit (CPU) (Section 3.1 on page 3-3) 

- Floating-point/integer multiplier 

“ ALU for floating-point, integer, and logical operations 

- Auxiliary register arithmetic units (ARAUs) 

- CPU register file 

• Memory Organization (Section 3.2 on page 3-7) 

“ RAM, ROM, and cache 

- Memory maps 

“ Memory addressing modes 

- Instruction set summary 

• Internal Bus Operation (Section 3.3 on page 3-18) 

• External Bus Operation (Section 3.4 on page 3-19) 

• Peripherals (Section 3.5 on page 3-20) 

- Timers 

- Serial ports 

• Direct Memory Access (DMA) (Section 3.6 on page 3-21) 
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Architectural Overview 



Figure 3-1. TMS320C30 Block Diagram 
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Architectural Overview - Central Processing Unit (CPU) 


3.1 Central Processing Unit (CPU) 

The TMS320C30 has a register-based CPU architecture. The CPU consists 
of the following components: 

• Floating-point/integer multiplier 

• ALU for performing arithmetic (floating-point, integer)and logical oper¬ 
ations 

• 32-bit barrel shifter 

• Internal buses (CPU1 /CPU2 and REG1 /REG2) 

• Auxiliary register arithmetic units (ARAUs) 

• CPU register file. 

Figure 3-2 shows the various CPU components that are discussed In the 
succeeding subsections. 
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Figure 3-2. Central Processing Unit (CPU) 
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3.1.1 Multiplier 

The multiplier performs single-cycle multiplications on 24-blt integer and 
32-blt floating-point values. The TMS320C30 Implementation of floating¬ 
point arithmetic allows for floating-point operations at fixed-point speeds via 
a 60-ns Instruction cycle and a high degree of parallelism. To gain even higher 
throughput a multiply and ALU operation can be performed in a single cycle 
by using parallel instructions. 

When performing floating-point multiplication, the Inputs are 32-bit float¬ 
ing-point numbers, and the result is a 40-blt floating-point number. When 
performing Integer multiplication, the input data Is 24 bits and yields a 32-bit 
result. Refer to Section 5 for detailed information on data formats and float¬ 
ing-point operation. 

3.1.2 Arithmetic Logic Unit (ALU) 

The ALU performs single-cycle operations on 32-bit Integer, 32-bit logical, 
and 40-bit floating-point data, including single-cycle Integer and floating¬ 
point conversions. Results of the ALU are always maintained in 32-bit Integer 
or 40-bit floating-point formats. The barrel shifter Is used to shift up to 32 
bits left or right in a single cycle. 

Internal buses, CPU1/CPU2 and REG1/REG2, carry two operands from me¬ 
mory and two operands from the register file, thus allowing parallel multiplies 
and adds/subtracts on four integer or floating-point operands in a single cycle. 

3.1.3 Auxiliary Register Arithmetic Units (ARAUs) 

Two auxiliary register arithmetic units (ARAUO and ARAU1) can generate two 
addresses In a single cycle. The ARAUs operate in parallel with the multiplier 
and ALU. They support addressing with displacements. Index registers (IRO 
and IR1), and circular and bit-reversed addressing. Refer to Section 6 for a 
description of addressing modes. 

3.1.4 CPU Register File 

The TMS320C30 provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. All of these registers can be operated upon by the mul¬ 
tiplier and ALU, and can be used as general-purpose registers. However, the 
registers also have some special functions for which they are more suited than 
others. For example, the eight extended-precision registers are especially 
suited for maintaining extended-precision floating-point results. The eight 
auxiliary registers support a variety of indirect addressing modes and can be 
used as general-purpose 32-bit Integer and logical registers. The remaining 
registers provide system functions such as addressing, stack management, 
processor status, interrupts, and block repeat. Refer to Section 6 for detailed 
Information and examples of stack management and register usage. 

The registers names and assigned functions are listed In Table 3-1. Following 
the table, the function of each register or group of registers will be briefly de¬ 
scribed. Refer to Section 4 for detailed Information on each of the CPU reg¬ 
isters. 




Architectural Overview - Central Processing Unit (CPU) 


Table 3-1. CPU Registers 


REGISTER 

ASSIGNED FUNCTION 

NAME 


RO 

Extended-precision register 0 

R1 

Extended-precision register 1 

R2 

Extended-precision register 2 

R3 

Extended-precision register 3 

R4 

Extended-precision register 4 

R5 

Extended-precision register 5 

R6 

Extended-precision register 6 

R7 

Extended-precision register 7 

ARO 

Auxiliary register 0 

AR1 

Auxiliary register 1 

AR2 

Auxiliary register 2 

AR3 

Auxiliary register 3 

AR4 

Auxiliary register 4 

ARB 

Auxiliary register 5 

AR6 

Auxiliary register 6 

AR7 

Auxiliary register 7 

DP 

Data page pointer 

IRO 

Index register 0 

IR1 

Index register 1 

BK 

Block size 

SP 

System stack pointer 

ST 

Status register 

IE 

CPU/DMA interrupt enable 

IF 

CPU Interrupt flags 

lOF 

I/O flags 

RS 

Repeat start address 

RE 

Repeat end address 

RC 

Repeat counter 

PC 

Program Counter 


The extended-precision registers (R0-R7) are capable of storing and 
supporting operations on 32-bit integer and 40-bit floating-point numbers. 
Any instruction that assumes the operands are floating-point numbers uses 
bits 39-0. If the operands are either signed or unsigned integers, only bits 
31 -0 are used, bits 39-32 remain unchanged. This is true for all shift oper¬ 
ations. Refer to Section 4 for extended-precision register formats for float¬ 
ing-point and Integer numbers. 

The 32-blt auxiliary registers (AR0-AR7) can be accessed by the CPU and 
modified by the two Auxiliary Register Arithmetic Units (ARAUs). The primary 
function of the auxiliary registers is the generation of 24-bit addresses. They 
can also be used to perform a variety of functions, such as loop counters or 
as 32-blt general-purpose registers that can be modified by the multiplier and 
ALU. Refer to Section 6 for detailed information and examples of the use of 
auxiliary registers In addressing. 

The data page pointer (DP) Is a 32-bit register. The eight LSBs of the data 
page pointer are used by the direct addressing mode as a pointer to the page 
of data being addressed. Data pages are 64 k words long with a total of 256 
pages. 
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The 32-bit index registers (IRO and IR1) are used by the Auxiliary Register 
Arithmetic Unit (ARAU) for indexing the address. Refer to Section 6 for ex¬ 
amples of the use of index registers in addressing. 

The 32-bit block size register (BK) is used by the ARAU in circular ad¬ 
dressing to specify the data block size. 

The system stack pointer (SP) is a 32-bit register that contains the address 
of the top of the system stack. The SP always points to the last element 
pushed onto the stack. A push performs a preincrement and a pop, a postde¬ 
crement of the system stack pointer. The SP is manipulated by Interrupts, 
traps, calls, returns, and the PUSH and POP Instructions. Refer to Section 6.5 
for information about system stack management. 

The status register (ST) contains global information relating to the state 
of the CPU. Typically, operations set the condition flags of the status register 
according to whether the result Is zero, negative, etc. This Includes register 
load and store operations as well as arithmetic and logical functions. When 
the status register Is loaded, however, a bIt-for-bIt replacement is performed 
on the current contents with the contents of the source operand regardless of 
the state of any bits In the source operand. Therefore, following a load, the 
contents of the status register are identically equal to the contents of the 
source operand. This allows the status register to be easily saved and restored. 
See Table 4.2 for a list and definitions of the status register bits. 

The CPU/DMA interrupt enable register (IE) is a 32-bit register. The 
CPU interrupt enable bits are In locations 10-0. The DMA Interrupt enable 
bits are In locations 26-16. A 1 in a CPU/DMA interrupt enable register bit 
enables the corresponding interrupt. A 0 disables the corresponding interrupt. 
Refer to Section 4.1 for bit definitions. 

The CPU interrupt flag register (IF) Is also a 32-bit register (see Section 
4.1). A 1 in a CPU interrupt flag register bit Indicates that the corresponding 
interrupt is set. A 0 indicates that the corresponding interrupt is not set. 

The I/O flags register (lOF) controls the function of the dedicated external 
pins, XFO and XFl. These pins may be configured for input or output, and 
they may also be read from and written to. See Section 4.1 for detailed infor¬ 
mation. 

The repeat counter (RC) Is a 32-blt register used to specify the number of 
times a block of code is to be repeated when performing a block repeat. When 
operating In the repeat mode, the 32-bit repeat start address register 
(RS) contains the starting address of the block of program memory to be re¬ 
peated and the 32-bit repeat end address register (RE) contains the 
ending address of the block to be repeated. 

The program counter (PC) is a 32-blt register containing the address of the 
next instruction to be fetched. Although the PC is not part of the CPU register 
file. It Is a register that can be modified by Instructions that modify the program 
flow. 
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3.2 Memory Organization 

The total memory space of the TMS320C30 is 16M (million) 32-bit words. 
Program, data, and I/O space are contained within this 16M-word address 
space, thus allowing tables, coefficients, program code, or data to be stored 
in either RAM or ROM. In this way, memory usage can be maximized and 
memory space allocated as desired. 

3.2.1 RAM, ROM, and Cache 

Figure 3-3 shows how the memory is organized on the TMS320C30. RAM 
blocks 0 and 1 are each 1 K x 32 bits. The ROM block Is 4K x 32 bits. Each 
RAM and ROM block Is capable of supporting two accesses In a single cycle. 
The separate program buses, data buses, and DMA buses allow for parallel 
program fetches, data reads and writes, and DMA operations. For example: 
the CPU can access two data values In one RAM block and perform an ex¬ 
ternal program fetch in parallel with the DMA loading another RAM block, all 
within a single cycle. 

A 64 X 32-blt Instruction cache is provided to store often repeated sections 
of code, thus greatly reducing the number of off-chip accesses necessary. This 
allows for code to be stored off-chip In slower, lower-cost memories. The 
external buses are also freed for use by the DMA, external memory fetches, or 
other devices In the system. 

Refer to Section 4 for detailed Information about the memory and instruction 
cache. 
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Figure 3-3. Memory Organization 
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3.2.2 Memory Maps 


3 


The memory map is dependent upon whether the processor is running in the 
microprocessor mode (MC/MP = 0) or the microcomputer mode (MC/MP = 
1). The memory maps for these modes are very similar (see Figure 3-4). Lo¬ 
cations SOOOOOh through 801 FFFh are mapped to the expansion bus. When 
this region is accessed, MSTRB is active. Locations 802000h through 
803FFFh are reserved. Locations 804000h through 805FFFh are mapped to 
the expansion bus. When this region is accessed, lOSTRB is active. Locations 
806000h through 807FFFh are reserved. All of the memory-mapped periph¬ 
eral registers are in locations 808000h through 8097FFh. In both modes, 
RAM block 0 Is located at addresses 809800h through 809BFFh, and RAM 
block 1 Is located at addresses 809C00h through 809FFFh. Locations 
80A000h through OFFFFFFh are accessed over the external memory port 
(STRB active). 


In microprocessor mode, the 4K on-chip ROM is not mapped Into the 
TMS320C30 memory map. Locations Oh through 3Fh consist of interrupt 
vector, trap vector, and reserved locations, all of which are accessed over the 
external memory port (STRB active). Locations 40h through 7FFFFFh are also 
accessed over the external memory port. 


In microcomputer mode, the 4K on-chip ROM is mapped into locations Oh 
through OFFFh. There are 192 locations (Oh through BFh) within this block 
for Interrupt vectors, trap vectors, and a reserved space. Locations lOOOh 
through 7FFFFFh are accessed over the external memory port (STRB active). 

Section 4.2 describes the memory maps In greater detail. The peripheral bus 
map and the vector locations for reset, interrupts, and traps are also given. 
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Figure 3-4. Memory Maps 
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3.2.3 Memory Addressing Modes 


3 


The TMS320C30 supports a base set of general-purpose instructions as well 
as arithmetic-intensive Instructions that are particularly suited for digital signal 
processing and other numeric-intensive applications. Refer to Section 6 for 
detailed information on addressing. 

Five groups of addressing modes are provided on the TMS320C30. Six types 
of addressing may be used within the groups, as shown In the following list: 

• General addressing modes: 

Register. The operand is a CPU register. 

- Short Immediate. The operand is a 16-bit immediate value. 

“ Direct. The operand is the contents of a 24-blt address. 

- Indirect. An auxiliary register Indicates the address of the operand. 

• Three-operand addressing modes: 

- Register. Same as for general addressing mode. 

Indirect. Same as for general addressing mode. 

• Parallel addressing modes: 

Register. The operand is an extended-precision register. 

~ Indirect. Same as for general addressing mode. 

• Long-immediate addressing mode. 

- Long immediate. The operand is a 24-bit Immediate value. 

• Conditional branch addressing modes: 

“ Register. Same as for general addressing mode. 

- PC-relative. A signed 16-blt displacement Is added to the PC. 


3.2.4 Instruction Set Summary 

Table 3-2 lists the TMS320C30 instruction set in alphabetical order. Each 
table entry shows the instruction mnemonic, description, and operation. Refer 
to Section 11 for a functional listing of the Instructions and individual in¬ 
struction descriptions. 
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Table 3-2. Instruction Set Summary 


MNEMONIC 

DESCRIPTION 

OPERATION 

ABSF 

Absolute value of a floating-point 
number 

lsrc| Rn 

ABSI 

Absolute value of an integer 

|src| Dreg 

ADDC 

Add Integers with carry 

src + Dreg + C Dreg 

ADDC3 

Add integers with carry (3-operand) 

srcl + src2 + C Dreg 

ADDF 

Add floating-point values 

src + Rn Rn 

ADDF3 

Add floating-point values (3-operand) 

srcl + src2 -► Rn 

ADD! 

Add integers 

src + Dreg -► Dreg 

ADD13 

Add integers (3-operand) 

srcl + src2 + Dreg 

AND 

Bitwise logIcal-AND 

Dreg AND src Dreg 

AND3 

Bitwise logica(-AND (3-operand) 

srcl AND src2 Dreg 

ANDN 

Bitwise logIcal-AND with complement 

Dreg AND sm -► Dreg 

ANDN3 

Bitwise logical-ANDN (3-operand) 

srcl AND srcl Dreg 

ASH 

Arithmetic shift 

If count ^ 0: 

(Shift Dreg left by count) Dreg 

Else: 

(Shift Dreg right by |count|) Dreg 

ASH3 

Arithmetic shift (3-operand) 

If count ^ 0: 

(Shift src left by count) Dreg 

Else: 

(Shift src right by [countj) -* Dreg 

Bcond 

Branch conditionally (standard) 

If cond = true: 

If Csrc Is a register, Csrc -► PC 

If Csrc Is a value, Csrc + PC -► PC 

Else, PC + 1 PC 

^condD 

Branch conditionally (delayed) 

If cond = true: 

If Csrc is a register, Csrc PC 

If Csrc Is a value, Csrc + PC + 3 PC 

Else, PC + 1 PC 


LEGEND: 


src - general addressing modes 

src1 ” three-operand addressing modes 

src2 - three-operand addressing modes 

Csrc - conditional-branch addressing modes 

Sreg - register address (any register) 

count ~ shift value (general addressing modes) 

SP - stack pointer 

GIE - global interrupt enable register 

RM - repeat mode bit 

TOS - top of stack 


Dreg - register address (any register) 

Rn - register address (R0-R7) 

Daddr - destination memory address 
ARn - auxiliary register n (AR0-AR7) 
addr - 24-bit immediate address (label) 
cond - condition code (see Section 11) 
ST ~ status register 

RE - repeat interrupt register 

RS - repeat start register 

PC “ program counter 
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Table 3-2. Instruction Set Summary (Continued) 


MNEMONIC 

DESCRIPTION 

OPERATION 

BR 

Branch unconditionally (standard) 

Value PC 

BRD 

Branch unconditionally (delayed) 

Value -► PC 

CALL 

Call subroutine 

PC + 1 TOS 

Value “► PC 

CALLcond 

Call subroutine conditionally 

If cond = true; 

PC + 1 TOS 

If Csrc is a register, Csrc PC 

If Csrc is a value, Csrc + PC -► PC 

Else, PC + 1 -► PC 

CMPF 

Compare floating-point values 

Set flags on Rn - src 

CMPF3 

Compare floating-point values 
(3-operand) 

Set flags on src1 - src2 

CMPI 

Compare integers 

Set flags on Dreg - src 

CMPI3 

Compare Integers (3-operand) 

Set flags on srcl - src2 

DBconc/ 

Decrement and branch conditionally 
(standard) 

ARn - 1 ARn 

If cond = true and ARn ^ 0: 

If Csrc is a register, Csrc “► PC 

If Csrc is a value, Csrc + PC PC 

Else, PC + 1 PC 

DBcondD 

Decrement and branch conditionally 
(delayed) 

ARn - 1 ARn 

If cond = true and ARn 0: 

If Csrc is a register, Csrc -► PC 

If Csrc is a value, Csrc + PC + 3 ”► PC 

Else, PC + 1 PC 

FIX 

Convert floating-point value to integer 

Fix (src) -► Dreg 

FLOAT 

Convert integer to floating-point value 

Float(src) Rn 

IDLE 

Idle until interrupt 

PC + 1 PC 

Idle until next Interrupt 

LDE 

Load floating-point exponent 

src(exponent) Rn (exponent) 

LDF 

Load floating-point value 

src -► Rn 


LEGEND: 


src - general addressing modes 

srcl " three-operand addressing modes 

src2 - three-operand addressing modes 

Csrc ~ conditional-branch addressing modes 

Sreg ~ register address (any register) 

count ~ shift value (general addressing modes) 

SP - stack pointer 

GIE ~ global interrupt enable register 

RM - repeat mode bit 

TOS - top of stack 


Dreg - register address (any register) 

Rn -- register address (R0-R7) 

Daddr - destination memory address 
ARn -• auxiliary register n (AR0-AR7) 
addr - 24-bit immediate address (label) 
cond “ condition code (see Section 11) 
ST - status register 

RE - repeat interrupt register 

RS “ repeat start register 

PC “ program counter 
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Table 3-2. Instruction Set Summary (Continued) 


MNEMONIC 

DESCRIPTION 

OPERATION 

LD?cond 

Load floating-point value 
conditionally 

If cond = true, src “► Rn 

Else, Rn is not changed 

LDFI 

Load floating-point value, 
interlocked 

Signal interlocked operation 
src Rn 

LDI 

Load integer 

src Dreg 

LD\cond 

Load integer conditionally 

If cond = true, src Dreg 

Else, Dreg is not changed 

LDM 

Load integer, interlocked 

Signal Interlocked operation 
src Dreg 

LDM 

Load floating-point mantissa 

src(mantlssa) -► Rn (mantissa) 

LSH 

Logical shift 

If count ^ 0: 

(Dreg left-shifted by count) -► Dreg 

Else: 

(Dreg right-shifted by Icountl) -♦ Dreg 

LSH3 

Logical shift (3-operand) 

If count ^ 0: 

(src left-shifted by count) Dreg 

Else: 

(src right-shifted by |count|) Dreg 

MPYF 

Multiply floating-point values 


MPYF3 

Multiply floating-point values 
(3-operand) 

src1 X src2 Rn 

MPYI 

Multiply integers 

src X Dreg Dreg 

MPYI3 

Multiply integers (3-operand) 

srcl X src2 Dreg 

NEGB 

Negate Integer with borrow 

0 - src - C Dreg 

NEGF 

Negate floating-point value 

0 - src Rn 

NEGI 

Negate integer 

0 - src Dreg 

NOP 

No operation 

Modify src If specified 

NORM 

Normalize floating-point value 

Normalize (src) -►Rn 

NOT 

Bitwise logical-complement 

src -► Dreg 

OR 

Bitwise logical-OR 

Dreg OR src -► Dreg 

OR3 

Bitwise logIcal-OR (3-operand) 

src1 OR src2 -► Dreg 


LEGEND: 


src 

- general addressing modes 

Dreg 

- register address (any register) 

src1 

- three-operand addressing modes 

Rn 

- register address (R0-R7) 

src2 

- three-operand addressing modes 

Daddr 

- destination memory address 

Csrc 

- conditional-branch addressing modes 

ARn 

- auxiliary register n (AR0-AR7) 

Sreg 

- register address (any register) 

addr 

- 24-bit immediate address (label) 

count 

- shift value (general addressing modes) 

cond 

- condition code (see Section 11) 

SP 

- stack pointer 

ST 

- status register 

GIE 

- global Interrupt enable register 

RE 

- repeat interrupt register 

RM 

- repeat mode bit 

RS 

- repeat start register 

TOS 

- top of stack 

PC 

- program counter 
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Table 3-2. Instruction Set Summary (Continued) 


MNEMONIC 

DESCRIPTION 

OPERATION 

POP 

Pop integer from stack 

*SP--* Dreg 

POPF 

Pop floating-point value from stack 

*SP—^ Rn 

PUSH 

Push integer on stack 

Sreg -+* + + SP 

PUSHF 

Push floating-point value on stack 

Rn * + + SP 

RETIcont/ 

Return from interrupt conditionally 

If cond = true or missing: 

*SP--> PC 

1 -► ST (GIE) 

Else, continue 

RETSco/?^/ 

Return from subroutine conditionally 

If cond = true or missing: 

*SP- -► PC 

Else, continue 

RND 

Round floating-point value 

Round (src) -♦ Rn 

ROL 

Rotate left 

Dreg rotated left 1 bit “► Dreg 

ROLC 

Rotate left through carry 

Dreg rotated left 1 bit through carry Dreg 

ROR 

Rotate right 

Dreg rotated right 1 bit Dreg 

RORC 

Rotate right through carry 

Dreg rotated right 1 bit thru carry -♦ Dreg 

RPTB 

Repeat block of Instructions 

src -► RE 

1 -► ST (RM) 

Next PC RS 

RPTS 

Repeat single instruction 

src -► RC 

1 ST (RM) 

Next PC RS 

Next PC RE 

SIGI 

Signal, interlocked 

Signal interlocked operation 

Wait for interlock acknowledge 

Clear Interlock 

STF 

Store floating-point value 

Rn Daddr 

STFI 

Store floating-point value, Interlocked 

Rn “► Daddr 

Signal end of interlocked operation 

STI 

Store Integer 

Sreg Daddr 

STM 

Store Integer, interlocked 

Sreg Daddr 

Signal end of interlocked operation 


LEGEND: 


src 

- general addressing modes 

Dreg 

- register address (any register) 

src1 

- three-operand addressing modes 

Rn 

“ register address (R0-R7) 

src2 

- three-operand addressing modes 

Daddr 

- destination memory address 

Csrc 

- conditional-branch addressing modes 

ARn 

- auxiliary register n (AR0-AR7) 

Sreg 

- register address (any register) 

addr 

- 24-bit immediate address (label) 

count 

- shift value (general addressing modes) 

cond 

- condition code (see Section 11) 

SP 

- stack pointer 

ST 

“ status register 

GIE 

~ global interrupt enable register 

RE 

- repeat Interrupt register 

RM 

- repeat mode bit 

RS 

~ repeat start register 

TOS 

- top of stack 

PC 

- program counter 
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Table 3-2. Instruction Set Summary (Continued) 


MNEMONIC 

DESCRIPTION 

OPERATION 

SUBB 

Subtract integers with borrow 

Dreg - src - C -► Dreg 

SUBB3 

Subtract integers with borrow 
(3-operand) 

srcl - src2 - C -► Dreg 

SUBC 

Subtract integers conditionally 

If Dreg - src ^ 0: 

[(Dreg-src) << 1] OR 1 Dreg 

Else, Dreg << 1 Dreg 

SUBF 

Subtract floating-point values 

Rn - src -► Rn 

SUBF3 

Subtract floating-point values 
(3-operand) 

src1 - src2 Rn 

SUB! 

Subtract integers 

Dreg - src -► Dreg 

SUBI3 

Subtract integers (3-operand) 

src1 - src2 Dreg 

SUBRB 

Subtract reverse integer with borrow 

src - Dreg - C Dreg 

SUBRF 

Subtract reverse floating-point value 

src - Rn Rn 

SUBRI 

Subtract reverse integer 

src - Dreg Dreg 

SWI 

Software interrupt 

Perform emulator interrupt sequence 

JHAPcond 

Trap conditionally 

If cond = true or missing: 

Next PC -► * ++ SP 

Trap vector N PC 

0 ST (GIE) 

Else, continue 

TSTB 

Test bit fields 

Dreg AND src 

TSTB3 

Test bit fields (3-operand) 

src1 AND src2 

XOR 

Bitwise exclusive-OR 

Dreg XOR src -♦ Dreg 

XOR3 

Bitwise exclusive-OR (3-operand) 

srcl XOR src2 Dreg 


LEGEND: 


src - general addressing modes 

srcl ~ three-operand addressing modes 

src2 ~ three-operand addressing modes 

Csrc - conditional-branch addressing modes 

Sreg - register address (any register) 

count - shift value (general addressing modes) 

SP - stack pointer 

GIE - global interrupt enable register 

RM - repeat mode bit 

TOS “ top of stack 


Dreg - register address (any register) 

Rn - register address (R0-R7) 

Daddr - destination memory address 
ARn - auxiliary register n (AR0-AR7) 
addr - 24-bit immediate address (label) 
cond - condition code (see Section 11) 
ST - status register 

RE - repeat interrupt register 

RS - repeat start register 

PC - program counter 
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Table 3-2. Instruction Set Summary (Continued) 


MNEMONIC 

DESCRIPTION 

OPERATION 

PARALLEL ARITHMETIC WITH STORE INSTRUCTIONS 

ABSF 

II STF 

Absolute value of a floating-point 

|src2l dst1 
!i src3 “► dst2 

ABSI 

II STI 

Absolute value of an integer 

|src2| -► dst1 

II src3 dst2 

ADDF3 

I! STF 

Add floating-point 

src1 + src2 -*• dst1 
tl src3 dst2 

ADDI3 

II STI 

Add integer 

srcl + src2 dst1 

II src3 dst2 

AND3 

II STI 

Bitwise logical-AND 

src1 AND src2 -► dst1 

11 src3 -► dst2 

ASH3 

II STI 

Arithmetic shift 

If count >: 0: 

src2 << count dst1 

11 src3 -► dst2 

Else: 

src2 >> |count| dst1 

11 src3 -► dst2 

FIX 

II STI 

Convert floating-point to integer 

Flx(src2) -► dst1 

II src3 dst2 

FLOAT 

II STF 

Convert integer to floating-point 

Float(src2) dst1 

II src3 “► dst2 

LDF 

II STF 

Load floating-point 

src2 dst1 

II src3 -►dst2 

LDI 

II STI 

Load integer 

src2 dst1 

II src3 -♦ dst2 

LSH3 

II STI 

Logical shift 

If count ^ 0: 

src2 << count “► dst1 

II src3 dst2 

Else: 

src2 >> Icountl -* dst1 

II src3 dst2 

MPYF3 

II STF 

Multiply floating-point 

srcl X src2 -► dst1 

II src3 -♦ dst2 

MPYI3 

II STI 

Multiply integer 

src1 X src2 dst1 

II src3 dst2 

NEGF 

II STF 

Negate floating-point 

0- src2 “► dst1 

II src3 dst2 


LEGEND: 

srcl - register addr (R0-R7) 
src3 - register addr (R0-R7) 
dsti - register addr (R0-R7) 


src2- indirect addr (disp = 0, 1, IRO, IR1) 
src4 - indirect addr (disp = 0, 1, IRO, IR1) 
dst2“ indirect addr (disp = 0, 1, IRO, IR1) 
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Table 3-2. Instruction Set Summary (Concluded) 


MNEMONIC 

DESCRIPTION 

OPERATION 

PARALLEL ARITHMETIC WITH STORE INSTRUCTIONS (Concluded) | 

NEGI 

II STI 

Negate Integer 

0 - src2 dst1 

II src3 dst2 

NOT3 

II STI 

Complement 

src1 -► dst1 

11 src3 -* dst2 

OR3 

II STI 

Bitwise logical-OR 

srcl OR src2 dsti 
llsrcS “► dst2 

STF 

II STF 

Store floating-point 

src1 dst1 
llsrcS -► dst2 

STI 

II STI 

Store integer 

src1 dsti 

11 src3 dst2 

SUBF3 

II STF 

Subtract floating-point 

src1 - src2 dsti 

11 src3 “+ dst2 

SUBI3 

II STI 

Subtract integer 

srcl - src2 dsti 

11 src3 dst2 

XOR3 

II STI 

Bitwise exclusive-OR 

srcl XOR src2 dsti 

II src3 -*■ dst2 

PARALLEL LOAD INSTRUCTIONS 


Load floating-point 

src2 dsti 

11 src4 dst2 

LDI 

II ldi 

Load integer 

src2 dsti 

II src4 -♦ dst2 

PARALLEL MULTIPLY AND ADD/SUBTRACT INSTRUCTIONS 

MPYF3 

11 ADDF3 

Multiply and add floating-point 

opi X op2 op3 

11 op4 + op5 -♦ op6 

MPYF3 

11 SUBF3 

Multiply and subtract floating-point 

opi X op2 op3 

11 op4 - op5 -* op6 

MPYI3 

11 ADDIS 

Multiply and add integer 

opi X op2 op3 

11 op4 + op5 -► op6 

MPYI3 

11 SUBI3 

Multiply and subtract integer 

opi X op2 -► op3 

11 op4 - op5 -► op6 


LEGEND: 

srcl - register addr (R0-R7) 
src3 ~ register addr (R0-R7) 
dst1 - register addr (R0-R7) 
op3 - register addr (RO or R1) 


src2 - indirect addr (disp = 0, 1, IRQ, IR1) 
src4- indirect addr (disp = 0, 1, IRQ, IR1) 
dst2- indirect addr (disp = 0, 1, IRQ, IR1) 
op6 - register addr (R2 or R3) 

using register addr. 


op1,op2,op4,op5 - Two of these operands must be specified 
and two must be specified using indirect 
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3.3 internal Bus Operation 

A large portion of the TMS320C30's high performance is due to the internal 
busing and the parallelism possible because of this busing. The separate 
program buses (PADDR and PDATA), data buses (DADDR1, DADDR2, and 
□ DATA), and DMA buses (DMAADDR and DMADATA) allow for parallel 
program fetches, data accesses, and DMA accesses. These buses connect all 
of the physical spaces (on-chip memory, off-chip memory, and on-chip pe¬ 
ripherals) supported by the TMS320C30. 

The program counter (PC) is connected to the 24-bit program address bus 
(PADDR). The instruction register (IR) is connected to the 32-bit program 
data bus (PDATA). These buses can fetch a single Instruction word every 
machine cycle. 

The 24-blt data address buses (DADDR1 and DADDR2) and the 32-bit data 
data bus (DDATA) support two data memory accesses every machine cycle. 
The DDATA bus carries data to the CPU over the CPU1 and CPU2 buses. The 
CPU1 and CPU2 buses can carry two data memory operands to the multiplier, 
ALU, and register file every machine cycle. Also internal to the CPU are reg¬ 
ister buses REG1 and REG2 that can carry two data values from the register 
file to the multiplier and ALU every machine cycle. 

The DMA controller is supported with a 24-blt address bus (DMAADDR) and 
a 32-blt data bus (DMADATA). These buses allow the DMA to perform me¬ 
mory accesses in parallel with the memory accesses occurring from the data 
and program buses. 
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3.4 External Bus Operation 

The TMS320C30 provides two external interfaces: the primary bus and ex¬ 
pansion bus. Both consist of a 32-bit data bus and a set of control signals. 
The primary bus has a 24-bit address bus, whereas the expansion bus has a 
13-bit address bus. Both buses can be used to address external program/data 
memory or I/O space. The buses also have an external RDY signal for wait- 
state generation. Additional wait states may be inserted under software con¬ 
trol. Refer to Section 8 for detailed information on external bus operation. 

The TMS320C30 supports four external interrupts (iNTB-TITfO), a number of 
internal interrupts, and a nonmaskable external RESET signal. Two external I/O 
flags, XFO and XF1, can be configured as input or output pins under software 
control. These pins are also used by the interlocked operations of the 
TMS320C30. The interlocked-operations instruction group supports multi¬ 
processor communication (see Section 7 for examples of the use of inter¬ 
locked instructions). 
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3.5 Peripherals 


3 


All TMS320C30 peripherals are controlled through memory mapped registers 
on a dedicated peripheral bus, composed of a 32-bit data bus and a 24-bit 
address bus. This peripheral bus permits straightforward communication to 
the peripherals. The TMS320C30 peripherals Include two timers and two se¬ 
rial ports. Figure 3-5 shows the peripherals with associated buses and signals. 
Refer to Section 9 for detailed information on the peripherals. 
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Figure 3-5. Peripheral Modules 
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3.5.1 Timers 

The two timer modules are general-purpose 32-bit timer/event counters, with 
two signaling modes and internal or external clocking. Each timer has an I/O 
pin that can be used as an input clock to the timer or as an output signal dri¬ 
ven by the timer. The pin may also be configured as a general-purpose I/O 
pin. 

3.5.2 Serial Ports 

The two serial ports are totally Independent. They are Identical with a com¬ 
plementary set of control registers controlling each one. Each serial port can 
be configured to transfer 8, 16, 24, or 32 bits of data per word. The clock for 
each serial port can originate either internally or externally. An internally 
generated divide-down clock is provided. The serial port pins are configurable 
as general-purpose I/O pins. The serial ports can also be configured as timers. 
A special handshake mode allows TMS320C30s to communicate over their 
serial ports with guaranteed synchronization. 
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3.6 Direct Memory Access (DMA) 

The on-chIp Direct Memory Access (DMA) controller can read from or write 
to any location in the memory map without interfering with the operation of 
the CPU. Therefore, the TMS320C30 can interface to slow external memories 
and peripherals without reducing throughput to the CPU. The DMA controller 
contains its own address generators, source and destination registers, and 
transfer counter. Dedicated DMA address and data buses allow for minimi¬ 
zation of conflicts between the CPU and the DMA controller. A DMA opera¬ 
tion consists of a block or single-word transfer to or from memory. Refer to 
Section 9 for detailed information on the DMA. Figure 3-6 shows the DMA 
controller with associated buses. 



Figure 3-6. DMA Controller 


In summary, the TMS320C30 is a powerful DSP system because of its Inte¬ 
gration of a powerful CPU, large memories, and sufficient buses to support Its 
speed. These along with peripherals such as a DMA controller, two serial 
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ports, and two timers are all contained on a single chip. The total system real 
estate and price have been reduced, providing the user with a true single-chip 
solution. 
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Section 4 


CPU Registers, Memory, and Cache 


The CPU register file contains 28 registers that can be operated upon by the 
multiplier and ALU (arithmetic logic unit). Included In the register file are the 
auxiliary registers, extended-precision registers, and index registers. The reg¬ 
isters in the CPU register file support addressing, floating-point/integer oper¬ 
ations, stack management, processor status, block repeats, and interrupts. 

The TMS320C30 provides a total memory space of 16M (million) 32-bit 
words. Program, data, and I/O space are contained within this 16M-word 
address space. Two RAM blocks of 1 K x 32 bits each and a ROM block of 4K 
X 32 bits permit two accesses In a single cycle. The memory maps for the 
microcomputer and microprocessor modes are similar, except that the on-chip 
ROM is not used in microprocessor mode. 

A 64 X 32-bit instruction cache stores often repeated sections of code. This 
greatly reduces the number of off-chip accesses necessary and allows code to 
be stored off-chip in slower, lower-cost memories. Three bits are provided in 
the CPU status register to control the clear, enable, or freeze of the cache. 

This section describes In detail each of the CPU registers, the memory maps, 
and the Instruction cache. Major topics in this section are as follows: 

• CPU Register File (Section 4.1 on page 4-2) 

- Extended-precision registers (R0-R7) 

- Auxiliary registers (AR0-AR7) 

“ Index registers (IRQ, IR1) 

- Block size register (BK) 

- Data page pointer (DP) 

- System stack pointer (SP) 

- Status register (ST) 

“ CPU/DMA interrupt enable register (IE) 

- CPU Interrupt flag register (IF) 

- I/O flags register (lOF) 

“ Repeat counter (RC) and block repeat registers (RS, RE) 

- Program counter (PC) 

• Memory (Section 4.2 on page 4-11) 

- Memory maps 

- Peripheral bus map 

- Reset/interrupt/trap map 

• Instruction Cache (Section 4.3 on page 4-15) 

- Cache architecture 

- Cache algorithm 

- Cache control bits 
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4.1 CPU Register File 



The TMS320C30 provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. The PC is not included In the 28 registers. All of these 
registers can be operated upon by the multiplier and ALU, and can be used 
as general-purpose 32-bit registers. However, the registers also have some 
special functions for which they are more suited than others. For example, the 
eight extended-precision registers are especially suited for maintaining ex¬ 
tended-precision floating-point results. The eight auxiliary registers support 
a variety of Indirect addressing modes and can be used as general-purpose 
32-bit integer and logical registers. The remaining registers provide system 
functions such as addressing, stack management, processor status. Interrupts, 
and block repeat. Refer to Section 6 for detailed Information and examples 
of the use of CPU registers in addressing. 

The registers names and assigned function are listed in Table 4-1. 


Table 4-1. CPU Registers 


REGISTER 

ASSIGNED FUNCTION 

NAME 


RO 

Extended-precision register 0 

R1 

Extended-precision register 1 

R2 

Extended-precision register 2 

R3 

Extended-precision register 3 

R4 

Extended-precision register 4 

R5 

Extended-precision register 5 

R6 

Extended-precision register 6 

R7 

Extended-precision register 7 

ARO 

Auxiliary register 0 

AR1 

Auxiliary register 1 

AR2 

Auxiliary register 2 

AR3 

Auxiliary register 3 

AR4 

Auxiliary register 4 

ARB 

Auxiliary register 5 

AR6 

Auxiliary register 6 

AR7 

Auxiliary register 7 

DP 

Data page pointer 

IRQ 

Index register 0 

IR1 

Index register 1 

BK 

Block size 

SP 

System stack pointer 

ST 

Status register 

IE 

CPU/DMA Interrupt enable 

IF 

CPU interrupt flags 

lOF 

I/O flags 

RS 

Repeat start address 

RE 

Repeat end address 

RC 

Repeat counter 

PC 

Program counter 
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4.1.1 Extended-Precision Registers (R0-R7) 


The eight extended-precision registers (R0-R7) are capable of storing and 
supporting operations on 32-bit integer and 40-bit floating-point numbers. 
These registers consist of two separate and distinct regions. Bits 39-32 of the 
extended-precision registers are dedicated to the storage of the exponent (e) 
of the floating-point number. Bits 31-0 store the mantissa of the floating¬ 
point number. Bit 31 is the sign (s) bit bits 30 - 0 are the fraction (f). Any 
instruction that assumes the operands are floating-point numbers uses bits 
39-0. Figure 4-1 illustrates the storage of 40-bit floating-point numbers in the 
extended-precision registers. 

39 32 31 30 0 





mantissa 


Figure 4-1. Extended-Precision Register Floating-Point Format 


For integer operations, bits 31 -0 of the extended-precision registers contain 
the integer (signed or unsigned). Any instruction that assumes the operands 
are either signed or unsigned integers uses only bits 31 -0. Bits 39-32 remain 
unchanged. This is true for ail shift operations. The storage of 32-bit integers 
in the extended-precision registers is shown in Figure 4-2. 


39 32 31 


unchanged 


signed or unsigned integer 


Figure 4-2. Extended-Precision Register Integer Format 


4.1.2 Auxiliary Registers (AR0-AR7) 

The eight 32-bit auxiliary registers (AR0-AR7) can be accessed by the CPU 
and modified by the two Auxiliary Register Arithmetic Units (ARAUs). The 
primary function of the auxiliary registers is the generation of 24-bit addresses. 
However, they can also be used to perform a variety of functions, such as loop 
counters in indirect addressing or as 32-bit general-purpose registers that can 
be modified by the multiplier and ALU. Refer to Section 6 for detailed infor¬ 
mation and examples of the use of auxiliary registers in addressing. 
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4.1.3 Data Page Pointer (DP) 

The data page pointer (DP) is a 32-bit register. The eight LSBs of the data 
page pointer are used by the direct addressing mode as a pointer to the page 
of data being addressed. Data pages are 64 k words long with a total of 256 
pages. Bits 31 - 8 are reserved and should always be kept zero by the user. 

4.1.4 Index Registers (IRQ, IR1) 

The 32-blt index registers (IRQ and IR1) are used by the Auxiliary Register 
Arithmetic Unit (ARAU) for indexing the address. Refer to Section 6 for de¬ 
tailed information and examples of the use of Index registers In addressing. 

4.1.5 Block Size Register (BK) 

The 32-bit block size register (BK) is used by the ARAU In circular addressing 
to specify the data block size (see Section 6.3). 

4.1.6 System Stack Pointer (SP) 

The system stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The SP always points to the last element pushed 
onto the stack. The SP is manipulated by Interrupts, traps, calls, returns, and 
the PUSH, PUSHF, POP, and POPF instructions. Pushes and pops of the 
stack perform pre-increment and post-decrement on ail 32 bits of the stack 
pointer. However, only the 24 LSBs are used as an address. Refer to Section 
6.5 for Information about system stack management. 

4.1.7 Status Register (ST) 

The status register (ST) contains global Information relating to the state of the 
CPU. Typically, operations set the condition flags of the status register ac¬ 
cording to whether the result is zero, negative, etc. This Includes register load 
and store operations as well as arithmetic and logical functions. When the 
status register is loaded, however,a bit-for-bit replacement is performed of the 
current contents with the contents of the source operand regardless of the 
state of any bits In the source operand. Therefore, following a load, the con¬ 
tents of the status register are identically equal to the contents of the source 
operand. This allows the status register to be easily saved and restored. At 
system reset, 0 Is written to this register. 

The format of the status register is shown in Figure 4-3. Table 4-2 defines the 
status register bits, their names and functions. 
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31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 



IQI 


IQI 

fBM 

IQ 

w 

mi 

— 

— 

IQJI 


imi 

m 

[mi 

15 

14 

13 

12 

11 

10 

9 

8 

7 

6 

5 

4 

3 

2 

1 

0 

1— 

1^9 

ItH 


d 

d 

mi 

Q2I 

OVM 

Hil3 

d 

d 

O 


a 

1 c 1 



R/W 

R/W 

R/W 

R/W 


R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 


NOTE: XX = reserved bit. 

R = read, W = write. 


Figure 4-3. Status Register 
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Table 4-2. Status Register Bits Summary 


BIT 

NAME 

FUNCTION 

0 

C 

Carry flag 

1 

V 

Overflow flag 

2 

z 

Zero flag 

3 

N 

Negative flag 

4 

UF 

Floating-point underflow flag 

5 

LV 

Latched overflow flag 

6 

LUF 

Latched floating-point underflow flag 


OVM 

Overflow mode flag. This flag affects only the integer operations. If 
OVM = 0, the overflow mode Is turned off; integer results that over¬ 
flow are treated in no special way. If OVM = 1, integer results over¬ 
flowing in the positive direction are set to the most positive 32-bit 
two's-complement number (7FFFFFFFh). If OVM = 1, integer results 
overflowing in the negative direction are set to the most negative 
32-bit two's-complement number (SOOOOOOOh). Note that the func¬ 
tion of V and LV is Independent of the setting of OVM. 

8 

RM 

Repeat mode flag. If RM = 1, the PC is being modified in either the 
repeat block or repeat-single mode. 

9 

Reserved 

Read as 0. 

10 

CF 

Cache Freeze. When CF = 1, the cache Is frozen. If the cache is en¬ 
abled (CE = 1), fetches from the cache are allowed, but no modifica¬ 
tion of the state of the cache is performed. This function can be used 
to save frequently used code resident in the cache. At reset 0 is writ¬ 
ten to this bit. Cache clearing (CC=1) is allowed when CF=0. 

11 

CE 

Cache Enable. CE = 1 enables the cache, allowing the cache to be 
used according to the LRU cache algorithm. CE = 0 disables the 
cache; no update or modification of the cache can be performed. No 
fetches are made from the cache. This function is useful for system 
debug. At system reset 0 is written to this bit. Cache clearing (CC 
= 1) Is allowed when CE=0. 

12 

CC 

Cache Clear. CC = 1 Invalidates all entries in the cache. This bit is 
always cleared after it is written to and thus always read as 0. At reset, 

0 Is written to this bit. 

13 

GIE 

Global Interrupt enable. If GIE - 1, the CPU responds to an enabled 
interrupt. If GIE = 0, the CPU does not respond to an enabled inter¬ 
rupt. 


Reserved 

Read as 0. 

16-31 

Reserved 

Value undefined. 
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4.1.8 CPU/DMA Interrupt Enable Register (IE) 

The CPU/DMA interrupt enable register (IE) is a 32-bit register (see Figure 
4-4). The CPU interrupt enable bits are in locations 10-0. The DMA interrupt 
enable bits are in locations 26-16. A 1 in a CPU/DMA interrupt enable reg¬ 
ister bit enables the corresponding Interrupt. A 0 disables the corresponding 
interrupt. At reset, 0 is written to this register. Table 4-3 defines the register 
bits, the bit names, and the bit functions. 


31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 

■ 

B 

B 

B 

B 

EDINT 

(DMA) 

ETINT1 

(DMA) 

ETINTO 

(DMA) 

ERINT1 

(DMA) 

EXINT1 

(DMA) 

ERINTO 

(DMA) 

EXINTO 

(DMA) 

QIBBI 

EINT2 

(DMA) 

EINT1 

(DMA) 

EINTO 

(DMA) 






R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

15 

14 

13 

12 

11 

10 

9 

8 

7 

6 

5 

4 

3 

2 

1 

0 

B 

B 

B 

B 

B 

EDINT 

(CPU) 

ETINT1 

(CPU) 

ETINTO 

(CPU) 

ERINT1 

(CPU) 

EXINT1 

(CPU) 

ERINTO 

(CPU) 

EXINTO 

(CPU) 

EINT3 

(CPU) 

EINT2 

(CPU) 

EINT1 

(CPU) 

EINTO 

(CPU) 






R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 


NOTE: XX = reserved bit, read as 0. 

R = read, W = write. 

Figure 4-4. CPU/DMA Interrupt Enable Register (IE) 
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Table 4-3. IE Register Bits Summary 


BIT 

NAME 

FUNCTION 

0 

EINTO 

Enable external interrupt 0 (CPU) 

1 

EINT1 

Enable external interrupt 1 (CPU) 

2 

EINT2 

Enable external interrupt 2 (CPU) 

3 

EINT3 

Enable external interrupt 3 (CPU) 

4 

EXINTO 

Enable serial port 0 transmit interrupt (CPU) 

5 

ERINTO 

Enable serial port 0 receive interrupt (CPU) 

6 

EXINT1 

Enable serial port 1 transmit interrupt (CPU) 

7 

ERINT1 

Enable serial port 1 receive interrupt (CPU) 

8 

ETINTO 

Enable timer 0 interrupt (CPU) 

9 

ETINT1 

Enable timer 1 interrupt (CPU) 

10 

EDINT 

Enable DMA controller Interrupt (CPU) 

11-15 

Reserved 

Value undefined 

16 

EINTO 

Enable external interrupt 0 (DMA) 

17 

EINT1 

Enable external interrupt 1 (DMA) 

18 

EINT2 

Enable external interrupt 2 (DMA) 

19 

EINT3 

Enable external interrupt 3 (DMA) 

20 

EXINTO 

Enable serial port 0 transmit interrupt (DMA) 

21 

ERINTO 

Enable serial port 0 receive Interrupt (DMA) 

22 

EXINT1 

Enable serial port 1 transmit Interrupt (DMA) 

23 

ERINT1 

Enable serial port 1 receive interrupt (DMA) 

24 

ETINTO 

Enable timer 0 interrupt (DMA) 

25 

ETINT1 

Enable timer 1 Interrupt (DMA) 

26 

EDINT 

Enable DMA controller interrupt (DMA) 

27-32 

Reserved 

Value undefined 


4.1.9 CPU Interrupt Flag Register (IF) 

The 32-bit CPU interrupt flag register (IF) Is shown in Figure 4-5. A 1 In a 
CPU interrupt flag register bit indicates that the corresponding interrupt Is set. 
The IF bits are set to 1 when an interrupt occurs. They may also be set to 1 
through software to cause an Interrupt. A 0 indicates that the corresponding 
interrupt is not set. If a 0 is written to an Interrupt flag register bit, the corre¬ 
sponding interrupt Is cleared. At reset 0 Is written to this register. Table 4-4 
lists the bit fields, bit field names, and bit field functions of the CPU interrupt 
flag register. 
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31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 



Q| 

B1 



XX 

I I 

XX 

1 1 

XX 

XX 

IQI 

|[Q|| 

mm 

IQI; 

15 

14 

13 

12 

11 

10 
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8 

7 

6 

5 

4 

3 

2 

1 

0 

on 

on 


Q 


BIPH 

|tinti 

TINTO 

RINT1 

|XINT1 


XINTO 



it!tn 

0333 






R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 


NOTE: XX = reserved bit, read as 0. 
R = read, W = write. 


Figure 4-5. CPU Interrupt Flag Register (IF) 


Table 4-4. IF Register Bits Summary 


BIT 

NAME 

FUNCTION 

0 

INTO 

External interrupt 0 flag 

1 

INTI 

External interrupt 1 flag 

2 

INT2 

External interrupt 2 flag 

3 

INT3 

External Interrupt 3 flag 

4 

XINTO 

Serial port 0 transmit interrupt flag 

5 

RINTO 

Serial port 0 receive interrupt flag 

6 

XINT1 

Serial port 1 transmit interrupt flag 

7 

RINT1 

Serial port 1 receive interrupt flag 

8 

TINTO 

Timer 0 interrupt flag 

9 

TINT1 

Timer 1 interrupt flag 

10 

DINTO 

DMA channel interrupt flag 

11-31 

Reserved 

Value undefined 


4.1.10 I/O Flags Register (lOF) 

The I/O flags register (lOF) controls the function of the dedicated external 
pins, XFO and XF1. These pins may be configured for input or output (see 
Table 4-5). They may also be read from and written to. At reset, 0 is written 
to this register. The bit fields, bit field names, and bit field functions are shown 
in Table 4-5. 


31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 

09 

09 

90 

90 

09 

90 

09 

09 

XX 

XX 

1 1 

90 

XX 

XX 

1 1 

m 

15 

14 

13 

12 

11 

10 
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8 

7 

6 

5 

4 

3 

2 

1 

0 

09 

90 

09 

09 

09 

09 

09 

09 

INXF1 

OUTXF1 

I/OXFI 

09 

INXFO 

OUTXFO 

T/oxfo 

1001 









R 

R/W 

R/W 


R 

R/W 

R/W 



NOTE: XX = reserved bit, read as 0. 

R = read, W = write. 

Figure 4-6. I/O Flag Register (lOF) 
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Table 4-5. lOF Register Bits Summary 


ECO 

NAME 

FUNCTION 

0 

Reserved 

Read as 0. 

1 

T/OXFO 

If T/OXFO = 0, XFO is configured as a general-purpose input pin. 

If T/OXFO = 1, XFO Is configured as a general-purpose output pin. 

2 

OUTXFO 

Data output on XFO. 

3 

INXFO 

Data Input on XFO. A write has no effect. 

4 

Reserved 

Read as 0. 

5 

T/OXF1 

If T/OXFI = 0, XF1 is configured as a general-purpose Input pin. 

If T/OXF1 = 1, XF1 Is configured as a general-purpose output pin. 

6 

OUTXF1 

Data output on XF1. 

7 

INXF1 

Data Input on XF1. A write has no effect. 


Reserved 

Read as 0. 


4.1.11 Repeat Counter (RC) and Block Repeat Registers (RS, RE) 

The repeat counter (RC) is a 32-bit register used to specify the number of 
times a block of code is to be repeated when performing a block repeat. 

The repeat start address register (RS) is a 32-bit register containing the start¬ 
ing address of the block of program memory to be repeated when operating 
In the repeat mode. 

The 32-blt repeat end address register (RE) contains the ending address of the 
block of program memory to be repeated when operating In the repeat mode. 

4.1.12 Program Counter (PC) 

The program counter (PC) is a 32-blt register containing the address of the 
next Instruction to be fetched. While the program counter is not part of the 
CPU register file. It Is a register that can be modified via instructions that mo¬ 
dify the program flow. 

4.1.13 Reserved Bits and Compatibility 

In order to retain compatibility with future members of the TMS320C3X family 
of microprocessors, reserved bits that are read as zero must be written as zero. 
Reserved bits that have an undefined value must not have their current value 
modified. In other cases, the user should maintain the reserved bits as speci¬ 
fied. 
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4.2 Memory 

The total memory space of the TMS320C30 is 16M (million) 32-bit words. 
Program, data, and I/O space are contained within this, allowing tables, co¬ 
efficients, program code, or data to be stored in either RAM or ROM. In this 
way, memory usage can be maximized and memory space allocated as desired. 

RAM blocks 0 and 1 are each 1 K x 32 bits. The ROM block is 4K x 32 bits. 
Each RAM and ROM block is capable of supporting two accesses in a single 
cycle. The separate program buses, data buses, and DMA buses allow for 
parallel program fetches, data reads/writes, and DMA operations. This is cov¬ 
ered in detail in Section 10.3. 

4.2.1 Memory Maps 

The memory map is dependent upon whether the processor is running In the 
microprocessor mode (MC,/MP = 0) or the microcomputer mode (MC/MP = 
1). The memory maps for these modes are very similar (see Figure 4-7). Lo¬ 
cations SOOOOOh through 801 FFFh are mapped to the expansion bus. When 
this region Is accessed, MSTRB is active. Locations 802000h through 
803FFFh are reserved. Locations 804000h through 805FFFh are mapped to 
the expansion bus. When this region Is accessed, lOSTRB is active. Locations 
806000h through 807FFFh are reserved. All of the memory-mapped periph¬ 
eral registers are In locations 808000h through 8097FFh. In both modes, 
RAM block 0 Is located at addresses 809800h through 809BFFh, and RAM 
block 1 Is located at addresses 809C00h through 809FFFh. Memory locations 
BOAOOOh through OFFFFFFh are accessed over the external memory port 
(STRB active). 

In microprocessor mode, the 4K on-chip ROM Is not mapped Into the 
TMS320C30 memory map. Locations Oh through 3Fh consist of interrupt 
vector, trap vector, and reserved locations, ail of which are accessed over the 
external memory port (STRB active). Locations 40h through 7FFFFFh are also 
accessed over the external memory port. 

In microcomputer mode, the 4K on-chip ROM is mapped into locations Oh 
through OFFFh. There are 192 locations (Oh through BFh) within this block 
for interrupt vectors, trap vectors, and a reserved space. Locations lOOOh 
through 7FFFFFh are accessed over the external memory port (STRB active). 

Reserved portions of the TMS320C30 memory space and reserved peripheral 
bus addresses should not be read and written by the user. Doing so may 
cause the TMS320C30 to halt operation and require a system reset to restart. 
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Oh 

BFh 

INTERRUPT LOCATIONS 
AND RESERVED (192) 
EXTERNAL STRS ACTIVE 

Oh 

BFh 

INTERRUPT LOCATIONS 
AND RESERVED (192) 

COh 


COh 

ROM 


EXTERNAL 

OFFFh 

(INTERNAL) 


STRB ACTIVE 

lOOOh 

EXTERNAL 

7FFFFFh 


7FFFFFh 

STRB ACTIVE 

SOOOOOh 

EXPANSION BUS 

SOOOOOh 

EXPANSION BUS 

SOIFFFh 

MSTRB ACTIVE (8K) 

SOIFFFh 

MSTRB ACTIVE (8K) 

802000h 

RESERVED 

802000h 

RESERVED 

SOSFFFh 

(8K) 

803FFFh 

(8K) 

804000h 

EXPANSION BUS 

804000h 

EXPANSION BUS 

805FFFh 

lOSTRB ACTIVE (8K) 

805FFFh 

lOSTRB ACTIVE (8K) 

806000h 

RESERVED 

806000h 

RESERVED 

807FFFh 

(8K) 

807FFFh 

(8K) 

SOSOOOh 

PERIPHERAL BUS 

808000h 

PERIPHERAL BUS 


MEMORY-MAPPED 


MEMORY-MAPPED 


REGISTERS 


REGISTERS 

8097FFh 

(INTERNAL) (6K) 

8097FFh 

(INTERNAL) (6K) 

809800h 

RAM BLOCK 0 (1K) 

809800h 

RAM BLOCK 0 (IK) 

809BFFh 

(INTERNAL) 

809BFFH 

(INTERNAL) 

809C00h 

RAM BLOCK 1 (1K) 

809C00h 

RAM BLOCK 1 (IK) 

809FFFh 

(INTERNAL) 

809FFFh 

(INTERNAL) 

80A000h 

EXTERNAL 

SOAOOOh 

EXTERNAL 

OFFFFFFh 

§TRB ACTIVE 

OFFFFFFh 

STRB ACTIVE 


MICROPROCESSOR MODE MICROPROCESSOR MODE 


Figure 4-7. Memory Maps 
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4.2.2 Peripheral Bus Map 

The memory-mapped peripheral registers are located starting at address 
808000h. The peripheral bus memory map is shown in Figure 4-8. Each pe¬ 
ripheral occupies a 16-word region of the memory map. Locations 80801 Oh 
through 80801 Fh and locations 808070h through 8097FFh are reserved. 

808000h 
80800Fh 

80801 Oh 
80801Fh 

808020h 
80802Fh 

808030h 
80803Fh 

808040h 
80804Fh 

808050h 
80805Fh 

808060h 
80806Fh 

808070h 
8097FFh 


DMA CONTROLLER REGISTERS 
_(16)_ 

RESERVED 

_ 06 )_ 

TIMER 0 REGISTERS 
_(16)_ 

TIMER 1 REGISTERS 

_ 06 )_ 

SERIAL PORT 0 REGISTERS 

_ 06 )_ 

SERIAL PORT 1 REGISTERS 
_ 06 )_ 

PRIMARY AND EXPANSION PORT 
REGISTERS (16) 


RESERVED 


Figure 4-8. Peripheral Bus Memory Map 


4.2.3 Reset/Interrupt/Trap Vector Map 

The addresses for the reset interrupt and trap vectors are Oh through 3Fh, as 
shown in Figure 4-9. The vectors stored in these locations are the addresses 
of the start of the respective reset interrupt and trap routines. For example, 
at reset the contents of memory location Oh (the reset vector) are loaded into 
the PC and execution begins from that address. 

Traps 28-31 are reserved and should not be used by the user. 
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OOh 

RESET 

01 h 

INTO 

02h 

INTI 

03h 

INT2 

04h 

INT3 

05h 

XINTO 

06h 

RINTO 

07 h 

XINTO 

08h 

RINT1 

09h 

TINTO 

OAh 

TINT1 

OBh 

DINT 

OCh 

RESERVED 

IFh 


20h 

TRAPO 



3Bh 

TRAP 27 

3Ch 

TRAP 28 (Reserved) 

3Dh 

TRAP 29 (Reserved) 

3Eh 

TRAP 30 (Reserved) 

3Fh 

TRAP 31 (Reserved) 


Figure 4-9. Reset, Interrupt, and Trap Vector Locations 
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4.3 Instruction Cache 

A 64 X 32-bit instruction cache allows for maximum system performance with 
minimal system cost. The instruction cache stores sections of code that can 
be fetched when repeatedly accessing time-critical code. This greatly reduces 
the number of off-chip accesses necessary and allows for code to be stored 
off-chip in slower, lower-cost memories. The external buses are also freed 
from program fetches, so they can be used by the DMA or other system ele¬ 
ments. 

The cache can operate In a completely automatic fashion without the need for 
user intervention. A form of the LRU (least-recently-used) cache update al- h 
gorithm Is used (see Section 4.3.2). H 

4.3.1 Cache Architecture 

The instruction cache (see Figure 4-10) contains 64 32-bit words of RAM. 

The cache Is divided into two 32-word segments. Associated with each seg¬ 
ment is a 19-bit segment start address (SSA) register. For each word In the 
cache, there is a corresponding single-bit: Present (P) flag. 


SEGMENT START 
ADDRESS REGISTERS 


/— 

^_ 

—^ 

□ 

SSA REGISTER 0 

□ 


!-■- 19 -^ 


SSA REGISTER 1 


P 

FLAGS 


SEGMENT WORDS 


32 


0 


SEGMENT WORD 0 


1 


SEGMENT WORD 1 


: 

i 


30 


SEGMENT WORD 30 


31 


SEGMENT WORD 31 



> SEGMENT 0 


0 


SEGMENT WORD 0 

'v 

1 


SEGMENT WORD 1 


: 

i 


30 


SEGMENT WORD 30 


31 


SEGMENT WORD 31 

z' 


S SEGMENT 1 


LRU 



MOST RECENTLY USED 
SEGMENT NUMBER 


LEAST RECENTLY USED 
SEGMENT NUMBER 


Figure 4-10. Instruction Cache Architecture 


When the CPU requests an instruction word from external memory, a check 
is made to determine If the word is already contained in the instruction cache. 
The partitioning of an instruction address as used by the cache control algo¬ 
rithm is shown in Figure 4-11. The 19 most-significant bits of the instruction 
address are used to select the segment and the 5 least-significant bits define 
the address of the instruction word within the pertinent segment. The 19 
MSBs of the Instruction address are compared with the two segment start 
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address (SSA) registers. If a match is found, a check is made of the relevant 
P flag. The P flag indicates whether or not the word within a particular seg¬ 
ment Is already present In cache memory. 


23 5 4 0 


segment start address 

instruction word 

(SSA) 

address within segment 


Figure 4-11. Address Partitioning for Cache Control Algorithm 


If there is no match, one of the segments must be replaced by the new data. 
The segment replaced in this circumstance is determined by the LRU (least- 
recently-used) algorithm. The LRU stack (see Figure 4-10) is maintained for 
this purpose. 

The LRU stack tracks which of the two segments qualifies as the least-re- 
cently-used after each access to the cache, therefore the stack contains either 
0,1 or 1,0. Each time a segment is accessed. Its segment number is removed 
from the LRU stack and pushed on the top of the LRU stack. Therefore, the 
number at the top of the stack Is the most-recently-used segment number and 
the number at the bottom of the stack is the least-recently-used segment 
number. 

At system reset, the LRU stack Is Initialized with 0 at the top, 1 at the bottom, 
and all P flags in the instruction cache are cleared. If both SSA registers are 
equal (due to system reset conditions) and a cache hit occurs, the Instruction 
word is fetched from the most recently used segment. 

When a replacement Is necessary, the least-recently-used segment Is selected 
for replacement. Also, the 32 P flags for the segment to be replaced are set 
to 0, and the segment's SSA register is replaced with the 19 MSBs of the In¬ 
struction address. ,* 

4.3.2 Cache Algorithm 

When the TMS320C30 requests an Instruction word from external memory, 
two possible actions occur: a cache hit or a cache miss. These are described 
in the following list: 

• Cache Hit. The requested instruction Is contained within the cache 
and the following actions occur: 

1) The Instruction word is read from the cache. 

2) The segment number of the segment within which the word Is 
contained is removed from the LRU stack and pushed to the top 
of the LRU stack, thus moving the other segment number to the 
bottom of the stack. 

• Cache Miss. The instruction is not contained In the cache. Types of 
cache miss are: 

1) Word Miss. The segment address register matches the Instruction 
address, but the relevant P flag Is not set. The following actions 
occur in parallel: 
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- The instruction word is read from memory and copied into 
the cache. 

- The segment number of the segment within which the word 
is contained is removed from the LRU stack and pushed to 
the top of the LRU stack, thus moving the other segment 
number to the bottom of the stack. 

- The relevant P flag Is set. 

2) Segment Miss. Neither of the segment addresses matches the in¬ 
struction address. The following actions occur In parallel: 

- The least-recently-used segment Is selected for replacement. 
The P flags for all 32 words are cleared. 

- The SSA register for the selected segment Is loaded with the 
19 MSBs of the address of the requested Instruction word. 

- The Instruction word Is fetched and copied Into the cache. 
It goes into the appropriate word of the least-recently-used 
segment. The P flag for that word is set 1. 

~ The segment number of the segment containing the Instruc¬ 
tion word Is removed from the LRU stack and pushed to the 
top of the LRU stack, thus moving the other segment number 
to the bottom of the stack. 

Only Instructions may be fetched from the program cache. All reads and writes 
of data In memory bypass the cache. Program fetches from Internal memory 
do not modify the cache and will not generate cache hits or misses. The pro¬ 
gram cache Is a single-access memory block. Dummy program fetches (i.e., 
following a branch) are treated by the cache as valid program fetches and can 
generate cache misses and cache updates. 

Care should be taken when using self-modifying code. If an instruction re¬ 
sides In cache and the corresponding location in primary memory is modified, 
the copy of the Instruction in cache is not modified. 

More efficient use of the cache can be made by aligning program code on 32 
word address boundaries. This can be done using the ALIGN directive when 
coding assembly language. 

4.3.3 Cache Control Bits 

Three cache control bits are located In the CPU status register: the cache clear 
bit (CC), cache enable bit (CE), and the cache freeze bit (CF). 

Cache Clear Bit (CC). Writing a 1 to the cache clear bit (CC) invalidates 
ail entries In the cache. All P flags in the cache are cleared. The CC bit is al¬ 
ways cleared after the cache is cleared. It is therefore always read as a 0. At 
reset the cache is cleared and 0 is written to this bit. 

Cache Enable Bit (CE). Writing a 1 to this bit enables the cache. When 
enabled, the cache is used according to the previously described cache algo¬ 
rithm. Writing a 0 to the cache enable bit disables the cache; no updates or 
modification of the cache can be performed. Specifically, no SSA register 
updates are performed, no P flags are modified (unless CC = 1), and the LRU 
stack is not modified. Writing a 1 to CC when the cache is disabled will clear 
the cache, and thus the P flags. No fetches are made from the cache when the 
cache is disabled. At reset, 0 is written to this bit. 
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Cache Freeze Bit (CF). When CF = 1, the cache is frozen. If, in addition, 
the cache is enabled, fetches from the cache are allowed, but no modification 
of the state of the cache Is performed. Specifically, no SSA register updates 
are performed, no P flags are modified (unless CC = 1), and the LRU stack is 
not modified. This function can be used to keep frequently used code resident 
in the cache. Writing a 1 to CC when the cache is frozen will clear the cache, 
and thus the P flags. At reset, 0 is written to this bit. 
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Table 4-6 defines the effect of the CE and CF bits used in combination. 

Table 4-6. Combined Effect of the CE and CF Bits 


CE 

CF 

EFFECT 

0 

0 

Cache not enabled 

0 

1 

Cache not enabled 

1 

0 

Cache enabled and not frozen 

1 

1 

Cache enabled and frozen 


4-19 




Instruction Cache - Cache Architecture/Algorithm 


4-20 































Section 5 


Data Formats and Floating-Point Operation 


Data is organized in the TMS320C30 architecture to provide three funda¬ 
mental data types: integer, unsigned-integer, and floating-point. Note that the 
terms, integer and signed-integer, are considered to be equivalent. The 
TMS320C30 supports short and single-precision formats for signed and un¬ 
signed integers. It also supports short, single-precision and extended- 
precision formats for floating-point data. 

Floating-point operations provide convenient and trouble-free computations 
while maintaining accuracy and precision. The TMS320C30 implementation 
of floating-point arithmetic allows for floating-point operations at integer 
speeds. The floating-point capability can prevent problems with overflow, 
operand alignment, and other burdensome tasks common in Integer oper¬ 
ations. 

This section discusses in detail the data formats and floating-point operations 
supported on the TMS320C30. Major topics in this section are as follows: 

• Integer Formats (Section 5.1 on page 5-2) 

• Unsigned-Integer Formats (Section 5.2 on page 5-3) 

• Floating-Point Formats (Section 5.3 on page 5-4) 

• Floating-Point Multiplication (Section 5.4 on page 5-9) 

• Floating-Point Addition and Subtraction (Section 5.5 on page 5-13) 

• Normalization (Section 5.6 on page 5-17) 

• Rounding (Section 5.7 on page 5-20) 

• Floating-Point to Integer Conversions (Section 5.8 on page 5-22) 

• Integer to Floating-Point Conversions (Section 5.9 on page 5-24) 
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5.1 Integer Formats 

The TMS320C30 supports two integer formats: a 16-bit short integer format 
and a 32-bit single-precision integer format. When extended-precision regis¬ 
ters are used as integer operands only bits 31 -0 are used; bits 39-32 remain 
unchanged and unused. 


5.1.1 Short Integer Format 



The short integer format is a 16-bit two's-complement integer format used for 
immediate integer operands. For those instructions that assume integer oper¬ 
ands, this format is sign-extended to 32 bits (see Figure 5-1). The range of 
an integer si, represented in the short integer format is -2^® ^ si ^ 2^^ -1. 
In Figure 5-1, s=signed bit 


15 


31 


Short Integer Format 
16 15 0 


ssssssssssssss 


5 5 


Sign Extension of a Short Integer 


Figure 5-1. Short Integer Format and Sign Extension of Short 

Integer 


5.1.2 Single-Precision Integer Format 

In the single-precision integer format the integer is represented in two's- 
complement notation. The range of an integer sp, represented in the single¬ 
precision integer format, is -2^^ ^ 5p < 2^^ -1. Figure 5-2 shows the 
single-precision integer format 

31 0 


Figure 5-2. Single-Precision Integer Format 
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5.2 Unsigned-Integer Formats 

Two unsigned-integer formats are supported on the TMS320C30: a 16-bit 
short format and a 32-bit single-precision format. In extended-precision reg¬ 
isters, the unsigned-integer operands use only bits 31-0; bits 39-32 remain 
unchanged. 

5.2.1 Short Unsigned-Integer Format 

Figure 5-3 shows the 16-bit short unsigned-integer format used for immedi¬ 
ate unsigned-integer operands. For those instructions that assume un¬ 
signed-integer operands, this format is zero-filled to 32 bits. In Figure 5-3 
below, X = MSB (1 orO). 


15 


31 


Short Unsigned-Integer 
Format 

16 15 0 


00000000000000 


X 


Zero Fill of a Short Unsigned Integer 
Figure 5-3. Short Unsigned-Integer Format and Zero Fill 


5.2.2 Single-Precision Unsigned-Integer Format 

In the single-precision unsigned-integer format the number Is represented as 
a 32-blt value, as shown In Figure 5-4. 

31 0 


Figure 5-4. Single-Precision Unsigned-Integer Format 
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5.3 Floating-Point Formats 

All TMS320C30 floating-point formats consist of three fields: an exponent 
field (e), a single sign-bit field (s), and a fraction field (/). These are stored 
as shown in Figure 5-5. The exponent field Is a two's-complement number. 
The sign field and fraction field may be considered as one unit and referred to 
as the mantissa field {man). The mantissa is used to represent a normalized 
two's-complement number. In a normalized representation, a most-signifi¬ 
cant nonsign bit is implied, thus providing an additional bit of precision. The 
value of a floating-point number x as a function of the fields e, s, and f is given 
as 


X = 01./ X 2® 

10.f X 2® 


5 


if 5 = 0 
if 5 = 1 

If e = most negative two's-complement value 
for the specified exponent field width. 



t- 


man (mantissa) 



Figure 5-5. Generic Fioating-Point Format 


Three floating-point formats are supported on the TMS320C30. The first is a 
short floating-point format for immediate floating-point operands, consisting 
of a 4-blt exponent, 1 sign bit, and an 11-bit fraction. The second is a sin¬ 
gle-precision format consisting of an 8-blt exponent, 1 sign bit, and a 23-blt 
fraction. The third is an extended-precision format consisting of an 8-bit ex¬ 
ponent, 1 sign bit, and a 31 -bit fraction. 


5.3.1 Short Floating-Point Format 


In the short floating-point format, floating-point numbers are represented by 
a two's-complement 4-bit exponent field (e) and a two's-complement 12-bit 
mantissa field (man) with an implied most-significant nonsign bit. 


15 


12 


1l|l0 


0 



Figure 5-6. Short Floating-Point Format 
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Operations are performed with an implied binary point between bits 11 and 
10. When the implied most-significant nonsign bit is made explicit it is lo¬ 
cated to the immediate left of the binary point. The floating-point two's- 
complement number x in the short floating-point format is given by 

X = 01./ X 2® If s = 0 

10./ X 2® If 5 = 1 

0 if e = -8, s = 0, / = 0 


The following reserved values must be used to represent zero in the short 
floating-point format: 

e = - 8 
s = 0 
/= 0 


The following examples illustrate the range and precision of the short float¬ 
ing-point format: 

Most Positive: x = (2 - x 2 ^ = 2.5594 x 10^ 

Least Positive: x = 1 x 2'^ = 7.8125 x 10 

Least Negative: x = (-1-2'") x 2 = -7.8163 x lO’^ 

Most Negative: x = -2 x 2^ = -2.5600 x 10 ^ 


5 


5.3.2 Single-Precision Floating-Point Format 

In the single-precision format, the floating-point number is represented by an 
8-bit exponent field (e) and a two's-complement 24-bit mantissa field {man) 
with an implied most-significant nonsign bit. 

Operations are performed with an implied binary point between bits 23 and 
22. When the implied most-significant nonsign bit is made explicit. It is lo¬ 
cated to the Immediate left of the binary point. The floating-point number x 
Is given by: 

X = 01./ X 2® If s = 0 

1 Q.f X 2® if s = 1 

0 If e = -128, s = 0,/= 0 

31 241 23 ]22 0 


e s f 


man- 

Figure 5-7. Single-Precision Floating-Point Format 


The following reserved values must be used to represent zero In the single- 
precision floating-point format: 

e = - 128 
5 = 0 
f=0 
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The following examples illustrate the range and precision of the single-preci¬ 
sion floating-point format 


Most Positive: 
Least Positive: 
Least Negative: 
Most Negative: 


X = (2 - 2-23) x2 1 27 = 3.4028234 x lO^S 
X = 1 X 2-127 = 5.8774717 x 10 -29 
X = (-1-2-23) X 2 -127 = -5.8774724 x 10-29 
X = -2 X 2127 = -3.4028236 x lO^S 


5.3.3 Extended-Precision Floating-Point Format 


5 


In the extended-precision format the floating-point number is represented by 
an 8-bit exponent field (e) and a 32-bit mantissa field {man) with an implied 
most-significant nonsign bit 

Operations are performed with an implied binary point between bits 31 and 
30. When the implied most-significant nonsign bit is made explicit it is lo¬ 
cated to the Immediate left of the binary point The floating-point number x 
is given by 


X = 01./ X 2 ® 
1 0.f X 2® 
0 


If 5 = 0 
if s = 1 

If e = -128, 5 = 0,/ = 0 


39 


32 31 30 


Figure 5-8. Extended-Precision Floating-Point Format 


The following reserved values must be used to represent zero in the extend¬ 
ed-precision floating-point format 

e = - 128 
s = 0 
/= 0 


The following examples illustrate the range and precision of the extended- 
precision floating-point format: 


Most Positive: 
Least Positive: 
Least Negative: 
Most Negative: 


X = (2 - 2-31) x2 127 = 3.4028236683 x 1038 
X = 1 X 2-127 = 5.8774717541 x 10 -39 
x= (-1-2-31) X 2-127 = -5.8774717569 x 10-39 
X = -2 X 2127 = -3.4028236691 x 10 38 
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5.3.4 Conversion Between Floating-Point Formats 

Floating-point operations assume several different formats for inputs and 
outputs. These formats often require conversion from one floating-point for¬ 
mat to another (e.g., short floating-point format to extended-precision float¬ 
ing-point format). Format conversions automatically occur in hardware, with 
no overhead, as a part of the floating-point operations. The four conversions 
are shown below with examples of the conversion. When a floating-point 
format zero is converted to a greater-precision format, it is always converted 
to a valid representation of zero in that format. In the below figures, S = sign 
bit of the exponent. 




Short floating-point format conversion to single-precision 
floating-point format. 


15 


12 11 10 


y y 


Short Floating-Point Format 


5 


31 


27 


24 23 22 


12 11 


SSSxxx 


y 0 


Single-Precision Floating-Point Format 


In this format, the exponent field Is sign-extended and the fraction field 
filled with zeros. 


Short floating-point format conversion to extended-precision 
floating-point format. 


15 


1211 10 


S XX 


Short Floating-Point Format 
39 35 32 31 30 20 19 


S SSSxxx X y y 


y 0 


Extended-Precision Floating-Point Format 


The exponent field In this format Is sign-extended and the fraction field 
filled with zeros. 
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• Single-precision floating-point format conversion to extend¬ 
ed-precision floating-point format. 

31 24 23 22 0 



Single-Precision Floating-Point Format 


39 32 31 30 8 7 0 



Extended-Precision Floating-Point Format 


The fraction field is filled with zeros. 


• Extended-precision floating-point format conversion to sin¬ 
gle-precision floating-point format. 

39 32 31 30 8 7 0 



Extended-Precision Floating-Point Format 


31 24 23 22 0 



Single-Precision Floating-Point Format 


The fraction field is truncated. 
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5.4 Floating-Point Multiplication 

A floating-point number a can be written in floating-point format as the fol¬ 
lowing formula, where a( man) is the mantissa and a(exp) is the exponent. 

a = a{man) x 

The product of a and b is c, defined as 

c = a X b = a(man) x b{man) x 2 {exp)) 

c{man) = a{man) x b{man) 
c(exp) = a (exp) + b(exp) 

When performing floating-point multiplication, source operands are always 
assumed to be in the single-precision floating-point format. If the source of 
the operands Is in short floating-point format, it is extended to the single¬ 
precision floating-point format. If the source of the operands is in extend¬ 
ed-precision floating-point format. It is truncated to single-precision format. 
These conversions automatically occur in hardware with no overhead. All re¬ 
sults of floating-point multiplications are In the extended-precision format. 
These multiplications occur in a single cycle. 

A flowchart for floating-point multiplication is shown In Figure 5-9. In step 
1, the 24-bit source operand mantissas are multiplied, producing a 50-bit re¬ 
sult c{man). (Note that Input and output data are always represented as nor¬ 
malized numbers.) In step 2, the exponents are added, yielding c(exp). Steps 
3 through 6 check for special cases. Step 3 checks for whether c{man) In 
extended-precision format is equal to zero. If c(man) is zero, step 7 sets 
c(exp) to -128, thus yielding the representation for zero. 

Steps 4 and 5 normalize the result. If a right shift of one Is necessary, then In 
step 8, c(man) Is right-shifted one bit and one Is added to c(exp). If a right 
shift of two is necessary, then in step 9, c{man) Is right-shifted two bits and 
two is added to c{exp). Step 6 occurs when the result is normalized. 

In step 10, c{man) is set in the extended-precision floating-point format. 
Steps 11 through 18 check for special cases of c(exp). In step 14, If c(exp) 
has overflowed (step 11) in the positive direction, then c{exp) Is set to the 
most-positive extended-precision format value. If c{exp) has overflowed In 
the negative direction, then c(exp) is set to the most-negative extended-pre¬ 
cision format value. If c(exp) has underflowed (step 12), then c Is set to zero 
(step 15); i.e., c(man) = 0 and c(exp) - -128. 
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a(man) b(man) a(exp) b(exp) 



c = a * b 


Figure 5-9. Flowchart for Floating-Point Multiplication 
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The following examples illustrate how floating-point multiplication is per¬ 
formed on the TMS320C30. For these examples, the implied most-significant 
nonsign bit is made explicit. 

Example 5-1. Floating-Point Multiply (Both Mantissas = -2.0) 


a = -2.0 X 23(exp) = 10.00000000000000000000000 x 2 ^(exp) 
b = -2.0 X = 10.00000000000000000000000 x 2 


where a and b are both represented in binary form according to the normalized single-pre¬ 
cision floating-point format. Then 


10.00000000000000000000000 X 2a<e^/’) 

X 10.00000000000000000000000 X _ 

0100.0000000000000000000000000000000000000000000000 X 2(a( + b(exp)) 
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To place this number in the proper normalized format, it is necessary to shift the mantissa two 
places to the right and add two to the exponent. This yields 


10.00000000000000000000000 X 2a(®^P) 

X 10.00000000000000000000000 X _ 

01.0000000000000000000000000000000000000000000000 X 2<3( + t)(ejfp)+2) 


In floating-point multiplication, the exponent of the result may overflow. This can occur 
when the exponents are initially added or when the exponent Is modified during normaliza¬ 
tion. 


Example 5-2. Floating-Point Multiply (Both Mantissas = 1.5) 

Let 

a = 1 .5 X 2a(®^P) = 01.10000000000000000000000 x 2 
b = 1.5 X = 01.10000000000000000000000 x 2 

where a and b are both represented in binary form according to the single-precision float¬ 
ing-point format. Then 

01.10000000000000000000000 X 2a(e-^P) 

X 01.10000000000000000000000 X 

0010.0100000000000000000000000000000000000000000000 X 2 (a(exp) + b(exp)) 

To place this number In the proper normalized format, it Is necessary to shift the mantissa 
one place to the right and add one to the exponent. This yields 

01.10000000000000000000000 X 2^(BXP) 

X 01.10000000000000000000000 X _ 

01.00100000000000000000000000000000000000000000000 X 2 (a(exp) +b(exp) +1) 


5-11 






Floating-Point Operations - Multiplication 


Example 5-3. Floating-Point Multiply (Both Mantissas = 1.0) 

Let 

a = 1.0 x23(^^P) = 01.00000000000000000000000 x 2 
b = 1.0 x2b(^^P) = 01.00000000000000000000000 x 2 

where a and b are both represented in binary form according to the single-precision float¬ 
ing-point format. Then 

01.00000000000000000000000 X 2^(exp) 

X 01.00000000000000000000000 X _ 

0001.0000000000000000000000000000000000000000000000 X 2(3(exp) +b(ex/j)) 

This number is in the proper normalized format. Therefore, no shift of the mantissa or mod¬ 
ification of the exponent is necessary. 

These examples have shown cases where the product of two normalized numbers can be 
normalized with a shift of zero, one, or two. For all normalized inputs with the floating-point 
format used by the TMS320C30, a normalized result can be produced by a shift of zero, one, 
or two. 

Example 5-4. Floating-Point Multiply Between Positive and Negative Numbers 
Let 

a = 1.0 = 01.00000000000000000000000 x 2 

b = 2.0 x2‘>(«'^P) = 10.00000000000000000000000 x 2 

Then 

01.00000000000000000000000 X 2a(e>fP) 

X 10.00000000000000000000000 X 2^(exp) _ 

1110.0000000000000000000000000000000000000000000000 X 2 + b(exp)) 

The result is c = -2.0 x 2^^(^xp) + b(exp)) 

Example 5-5. Floating-Point Multiply by Zero 

All multiplications by a floating-point zero yield a result of zero (/ = 0, 5 = 0, and exp = 
-128). 
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5.5 Floating-Point Addition and Subtraction 

In floating-point addition and subtraction, two floating-point numbers a and 
b can be defined as 

a = a(man) x 
b = b{man) x 

The sum (or difference) of a and b can be defined as 

^ = (atna/7) ± {b{man) x 2 x 2 ^(exp)^ 
if a (exp) ^ b(exp) 

= {{a(man) x 2 "(b(®^P)"3(exp))j ^ man)) x 2 •^(^xp) 
if a(exp) < b(exp) 

The flowchart for floating-point addition is shown in Figure 5-10. Since this 
flowchart assumes signed data, it is also appropriate for floating-point sub¬ 
traction. In this figure, it is assumed that a(exp) ^ b(exp). In step 1, the 
source exponents are compared, and c(exp) is set equal to the largest of the 
two source exponents. In step 2, d is set to the difference of the two expo¬ 
nents. In step 3, the mantissa with the smallest exponent. In this case 
a{man), is right-shifted d bits in order to align the mantissas. After the man¬ 
tissas have been aligned, they are added (step 4). 

Steps 5 through 7 check for a special case of c{man). If c(man) is zero (step 
5), then c(exp) is set to its most-negative value (step 8) to yield the correct 
representation of zero. If c{man) has overflowed c (step 6), then in step 9, 
c{man) is right-shifted one bit and one Is added to c(exp) In step 10, the 
result Is normalized. In steps 11 and 12, special cases of c(exp) are tested. 
If c(exp) has overflowed, then c is set to the most-positive extended-precision 
value if it is positive; otherwise, it Is set to the most-negative extended-preci¬ 
sion value. 
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a(man) b(man) a(exp) b(exp) 



c == a + b 


Figure 5-10. Flowchart for Floating-Point Addition 








Floating-Point Operations - Addition/Subtraction 


The following examples describe the floating-point addition and subtraction 
operations. It Is assumed that the data Is in the extended-precision floating¬ 
point format. 

Example 5-6. Floating-Point Addition 


In the case of two normalized numbers to be summed, let 

a = 1.5 = 01.1000000000000000000000000000000 x 2° 
b = 0.5 = 01.0000000000000000000000000000000 X 2-1 

It is necessary to shift b to the right by one so that a and b have the same 
exponent. This yields 

b = 0.5 = 00.1000000000000000000000000000000 X 2° 

Then 

01.1000000000000000000000000000000 X 2° 

+ 00.1000000000000000000000000000000 X 2 ° 

010.0000000000000000000000000000000 X 2° 

As In the case of multiplication, it is necessary to shift the binary point one 
place to the left and to add one to the exponent. This yields 

01.1000000000000000000000000000000 X 2° 

+ 00.1000000000000000000000000000000 X 2 0 

01.0000000000000000000000000000000 X 2l 

Example 5-7. Floating-Point Subtraction 

A subtraction Is performed in this example. Let 

a = 01.0000000000000000000000000000001 X 2° 
b = 01.0000000000000000000000000000000 X 2° 

The operation to be performed is a - b. The mantissas are already aligned since 
the two numbers have the same exponent. The result is a large cancellation 
of the upper bits, as shown below. 

01.0000000000000000000000000000001 X 2° 

- 01.0000000000000000000000000000000 X 2 ° 

00.0000000000000000000000000000001 X 2° 

The result must be normalized. In this case, a left-shift of 31 is required. The 
exponent of the result is modified accordingly. The result is 

01.0000000000000000000000000000001 X 2° 

- 01.0000000000000000000000000000000 X 2° 


01.0000000000000000000000000000000 X 2-31 
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Example 5-8. Floating-Point Addition with a 32-Bit Shift 
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This example illustrates a situation where a full 32-blt shift is necessary to 
normalize the result. Let 

a = 01.1111111111111111111111111111111 X 2^27 
b = 10.0000000000000000000000000000000 X 2^27 

The operation to be performed is a + b. 

01 1111111111111111111111111111111 ^ 2^27 
+ 10.0000000000000000000000000000000 X 2 ^27 

11.1111111111111111111111111111111 X 2^27 

Normalizing the result requires a left-shift of 32 and a subtraction of 32 from 
the exponent. The result is 

01 1111111111111111111111111111111 ^ 2^27 
+ 10.0000000000000000000000000000000 X 2 ^27 

10.0000000000000000000000000000000 X 295 


Example 5-9. Floating-Point Addition/Subtraction and Zero 

When floating-point addition and subtraction is performed with a floating¬ 
point 0,, the following identities are satisfied: 

a ± 0 = a (a ^ 0) 

0 ± 0 = 0 
0 - a = -a (a # 0) 
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5.6 Normalization Using the NORM Instruction 

The NORM instruction takes an extended-precision floating-point number, 
assumed to be unnormalized, and normalizes it. Since the number is assumed 
to be unnormalized, no implied most-significant nonsign bit is assumed. The 
NORM instruction executes the following three steps: 

1) Locate the most-significant nonsign bit of the floating-point number. 

2) Left-shift to normalize the number. 

3) Adjust the exponent. 

Given the extended-precision floating-point value a to be normalized, the 
normalization (norm ()) is performed as shown in Figure 5-11. 
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a 



c = norm(a) 


Figure 5-11. Flowchart for NORM Instruction Operation 
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Example 5-10. NORM Instruction 

Assume that an extended-precision register contains the value 

man = 00000000000000000001000000000001, exp = 0 

When the normalization is performed on a number assumed to be unnormal¬ 
ized, the binary point is assumed to be 

man = 0.0000000000000000001000000000001, exp = 0 

This number is then sign-extended one bit so that the mantissa contains 33 
bits. 

man = 00.0000000000000000001000000000001, exp = 0 

The intermediate result after the most-significant nonsign bit is located and 
the shift performed is 

man = 01.0000000000010000000000000000000, exp = -19 
The final 32-bit value output after removing the redundant bit is 
man = 00000000000010000000000000000000, exp = -19 


The NORM instruction is useful for counting the number of leading zeros or 
leading ones in a 32-blt field. If the exponent is initially zero, the absolute 
value of the final value of the exponent Is the number of leading ones or zeros. 
This instruction Is also useful for manipulating unnormalized floating-point 
numbers. 
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5.7 Rounding: The RND Instruction 

The RND instruction rounds a number from the extended-precision floating¬ 
point format to the single-precision floating-point format. Rounding is similar 
to floating-point addition. Given the number a to be rounded, the following 
operation is performed first. 

c = a{man) x +(1x2 {a(exp)-24)) 

Next a conversion from extended-precision floating-point to single-precision 
floating-point format is performed. Given the extended-precision floating¬ 
point value, the rounding (rnd()) is performed as shown In Figure 5-12. 
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2 a(exp) -24 



c = rnd(8) 


5 


Figure 5-12. Flowchart for Floating-Point Rounding by the RIMD 

Instruction 
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5.8 Floating-Point to Integer Conversion 

Floating-point to integer conversion, using the FIX Instructions, allow ex¬ 
tended-precision floating-point numbers to be converted to single-precision 
integers in a single cycle. The floating-point to integer conversion of the value 
X will be referred to here as fix(x). The conversion will not overflow if a, the 
number to be converted. Is in the range: 

-231 ^ a s 231 - 1 


5 


First, It is necessary to be certain that 
a (exp) ^ 30 

If these bounds are not met, an overflow occurs. If an overflow occurs in the 
positive direction, the output is the most positive integer. If an overflow oc¬ 
curs in the negative direction, the output Is the most negative Integer. If 
a(exp) is within the valid range, then a{man), with implied bit included. Is 
sign-extended and right-shifted (rs) by the amount 

rs = 31 - a (exp) 

This right-shift (rs) shifts out those bits corresponding to the fractional part 
of the mantissa. For example: 

If 0 < X < 1, then flx(x) = 0. 

If -1 < X < 0, then fix(x) = -1. 

The flowchart for the floating-point to integer conversion Is shown in Figure 
5-13. 
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a 



c = fix(a) 


Figure 5-13. Flowchart for Floating-Point to Integer Conversion 
by FIX Instructions 
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Floating-Point Operations - Conversion to Floating-Point 


5.9 Integer to Floating-Point Conversion Using the FLOAT 
Instruction 

Integer to floating-point conversion, using the FLOAT instruction, allows sin¬ 
gle-precision integers to be converted to extended-precision floating-point 
numbers. The flowchart for this conversion is shown in Figure 5-14. 


a 



c == float (a) 


Figure 5-14. Flowchart for Integer to Floating-Point Conversion 
Using the FLOAT Instruction 
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Section 6 

Addressing 


The TMS320C30 supports five groups of powerful addressing modes. Six 
types of addressing may be used within the groups, which allow access of 
data from memory, registers, and the instruction word. This section details the 
operation, encoding, and Implementation of the addressing modes. Also dis¬ 
cussed is the management of system stacks, queues, and deques In memory. 
The major topics in this section are: 

• Types of Addressing (Section 6.1 on page 6-2) 

- Register 

- Direct 

- Indirect 

- Short-immediate 
~ Long-immediate 

- PC-relative 

• Groups of Addressing Modes (Section 6.2 on page 6-18) 

- General addressing modes 

- Three-operand addressing modes 
Parallel addressing modes 

- Long-Immediate addressing mode 

- Conditional-branch addressing modes 

• Circular Addressing (Section 6.3 on page 6-22) 

• Bit-Reversed Addressing (Section 6.4 on page 6-26) 

• System Stack Management (Section 6.5 on page 6-27) 
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6.1 Types of Addressing 

Six types of addressing allow access of data from memory, registers, and the 
instruction word. They are: 

• Register 

• Direct 

'«• 

• Indirect 

• Short-immediate 

• Long-immediate 

• PC-relative 

Some types of addressing are appropriate for some instructions and not others. 
For this reason, the types of addressing are used in the five different groups 
of addressing modes as follows: 

• General addressing modes (G): 

- Register 

- Direct 

- Indirect 

- Short-immediate 

• Three-operand addressing modes (T): 

- Register 

- Indirect 

• Parallel addressing modes (P): 

- Register 

- Indirect 

• Long-immediate addressing mode 

- Long-immediate 

• Conditional-branch addressing modes (B): 

- Register 

~ PC-relative 

The six types of addressing will be discussed first, followed by the five groups 
of addressing modes. 

6.1.1 Register Addressing 

In register addressing, the operand is contained in a CPU register, as shown 
in the example below. 

ABSF R1 ; R1 = |R1! 

The syntax for the CPU registers, the assembler syntax, and the assigned 
function for those registers are listed in Table 6-1. 
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Table 6>1. CPU Register/Assembler Syntax and Function 


CPU REGISTER 

ASSEMBLER 

ASSIGNED 

ADDRESS 

SYNTAX 

FUNCTION 

OOh 

RO 

Extended-precision register 

01 h 

R1 

Extended-precision register 

02h 

R2 

Extended-precision register 

03h 

R3 

Extended-precision register 

04h 

R4 

Extended-precision register 

05h 

R5 

Extended-precision register 

06h 

R6 

Extended-precision register 

07h 

R7 

Extended-precision register 

08h 

ARO 

Auxiliary register 

09h 

AR1 

Auxiliary register 

OAh 

AR2 

Auxiliary register 

OBh 

AR3 

Auxiliary register 

OCh 

AR4 

Auxiliary register 

ODh 

AR5 

Auxiliary register 

OEh 

AR6 

Auxiliary register 

OFh 

AR7 

Auxiliary register 

lOh 

DP 

Data page pointer 

11h 

IRO 

Index register 0 

12h 

IR1 

Index register 1 

13h 

BK 

Block size 

14h 

SP 

Active stack pointer 

15h 

ST 

Status register 

16h 

IE 

CPU/DMA interrupt enable 

17h 

IF 

CPU interrupt flags 

18h 

lOF 

I/O flags 

19h 

RS 

Repeat start address 

lAh 

RE 

Repeat end address 

IBh 

RC 

Repeat counter 


6.1.2 Direct Addressing 

In direct addressing, the data address is formed by the concatenation of the 
eight least-significant bits of the data page pointer (DP) with the 16 least- 
significant bits of the instruction word (expr). This results In 256 pages (64 
K words per page), giving the programmer a large address space without 
needing to change the page pointer. The syntax and operation for direct ad¬ 
dressing are listed below. 

Syntax: @expr 

Operation: address = DP concatenated with expr 

Figure 6-1 shows the formation of the data address. Example 6-1 gives an 
instruction example with data before and after Instruction execution. 
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31 16 15 0 



Figure 6<1. Direct Addressing 
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Example 6-1. Direct Addressing 

ADDI @0BCDEh,R7 


Before Instruction: 

DP = 8Ah 
R7 = Oh 

Data at SABCDEh = 12345678h 


After Instruction: 

DP = 8Ah 

R7 = 12345678h 

Data at 8ABCDEh = 12345678h 


6.1.3 Indirect Addressing 

Indirect addressing Is used to specify the address of an operand In memory 
through the contents of an auxiliary register, optional displacements, and In¬ 
dex registers. Only the 24 least-significant bits of the auxiliary registers and 
Index registers are used In Indirect addressing. This arithmetic Is performed 
by the auxiliary register arithmetic units (ARAUs) on these lower 24 bits and 
is unsigned. The upper eight bits are unmodified. 

The flexibility of indirect addressing Is possible because the ARAUs on the 
TMS320C30 are used to modify auxiliary registers in parallel with operations 
within the main CPU. Indirect addressing Is specified by a five-bit field In the 
instruction word, referred to as the mod field. A displacement Is either an ex¬ 
plicit unsigned 8-bit integer contained in the instruction word or an implicit 
displacement of one. Two Index registers, IRQ and IRl, can also be used In 
Indirect addressing. In some cases, an addressing scheme using circular or 
bit reversed addressing is optional. The mechanism for generating addresses 
in circular addressing is discussed in Section 6.3, bit reversed in Section 6.4. 

Table 6-2 lists the various kinds of Indirect addressing, along with the value 
of the modification (mod) field, assembler syntax, operation, and function for 
each. The succeeding 18 examples show the operation for each kind of indi¬ 
rect addressing. 
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Table 6 - 2 . Indirect Addressing 


MOD FIELD 

SYNTAX 

OPERATION 

DESCRIPTION 

i INDIRECT ADDRESSING WITH DISPLACEMENT I 

00000 

*+ARn(disp) 

addr = ARn + disp 

With pre-displacement add 

00001 

*-ARn(disp) 

addr = ARn - disp 

With pre-displacement subtract 

00010 

* + +ARn(disp) 

addr = ARn + disp 

ARn = ARn + disp 

With pre-displacement add and modify 

00011 

*--ARn(disp) 

addr = ARn - disp 

ARn = ARn - disp 

With pre-displacement subtract and 
modify 

00100 

*ARn + + (disp) 

addr = ARn 

ARn = ARn + disp 

With post-displacement add and 
modify 

00101 

*ARn--(disp) 

addr = ARn 

ARn = ARn - disp 

With post-displacement subtract and 
modify 

00110 

*ARn + + (disp)% 

addr = ARn 

ARn = circ(ARn + disp) 

With post-displacement add and 
circular modify 

00111 

*ARn--(disp)% 

addr + ARn 

ARn = circ(ARn - disp) 

With post-displacement subtract and 
circular modify 

INDIRECT ADDRESSING WITH INDEX REGISTER IRO | 

01000 

*+ARn(IR0) 

addr = ARn + IRO 

With pre-index (IRO) add 

01001 

*-ARn(IR0) 

addr = ARn - IRO 

With pre-index (IRO) subtract 

01010 

* + +ARn(IR0) 

addr = ARn + IRO 

ARn = ARn + IRO 

With pre-index (IRO) add and modify 

01011 

*--ARn(IR0) 

addr = ARn - IRO 

ARn = ARn - IRO 

With pre-index (IRO) subtract and 
modify 

01100 

*ARn + -h(|R0) 

addr = ARn 

ARn = ARn + IRO 

With post-index (IRO) add and modify 

01101 

*ARn-(IR0) 

addr = ARn 

ARn = ARn - IRO 

With post-index (IRO) subtract and 
modify 

OHIO 

*ARn + + (IR0)% 

addr = ARn 

ARn = circ(ARn + IRO) 

With post-index (IRO) add and 
circular modify 

01111 

*ARn-(IR0)% 

addr = ARn 

ARn = circ(ARn - IRO) 

With post-index (IRO) subtract and 
circular modify 


LEGEND: 
addr 
ARn 
IRn 
disp 
+ + 

circ() 

% 

B 


= memory address 
= auxiliary register ARO - AR7 
= index register IRQ or IR1 
= displacement 
= add and modify 
= subtract and modify 
= address in circular addressing 
= where circular addressing is performed 
= where bit-reversed addressing is performed 
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Table 6-2. Indirect Addressing (Concluded) 


|mod field 

SYNTAX 

OPERATION 

DESCRIPTION 

I INDIRECT ADDRESSING WITH INDEX REGISTER IR1 | 

10000 

*+ARn(IR1) 

addr = ARn + IR1 

With pre-index (IR1) add 

10001 

*-ARn(IR1) 

addr = ARn - IR1 

With pre-index (IR1) subtract 

10010 

• + +ARn(IR1) 

addr = ARn + IR1 

ARn = ARn + IR1 

With pre-index (IR1) add and modify 

10011 

*-ARn(IR1) 

addr = ARn - IR1 

ARn = ARn - IR1 

With pre-index (IR1) subtract and 
modify 

10100 

•ARn + + (IR1) 

addr = ARn 

ARn = ARn + |R1 

With post-index (IR1) add and modify 

10101 

*ARn-(IR1) 

addr = ARn 

ARn = ARn - IR1 

With post-index (IR1) subtract and 
modify 

10110 

*ARn + + (IR1)% 

addr = ARn 

ARn = circ(ARn + IR1) 

With post-index (IR1) add and 
circular modify 

10111 

*ARn-(IR1)% 

EBHIHil 

With post-Index (IR1) subtract and 
circular modify 

1 INDIRECT ADDRESSING (SPECIAL CASES) | 

11000 

*ARn 

addr = ARn 

Indirect 

11001 

*ARn + + {IR0)B 

addr = ARn 

ARn = B(ARn + IRO) 

With post-index (IRO) add and 
bit-reversed modify 


LEGEND; 
addr 
ARn 
IRn 
disp 
+ + 


circO 

% 

B 


= memory address 
= auxiliary register ARO - ART 
= index register IRQ or IR1 
= displacement 
= add and modify 
= subtract and modify 
= address in circular addressing 
= where circular addressing is performed 
= where bit-reversed addressing is performed 




















































Addressing - Types of Addressing 

Example 6-2. Auxiliary Register Indirect 

The address of the operand to be fetched is contained in an auxiliary register (ARn). 
Operation: operand address = ARa? 

Assembler Syntax: *AHn 

Modification Field: 11000 
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Example 6-3. Indirect with Pre-Displacement Add 

The address of the operand to be fetched is the sum of an auxiliary register (ARn) and the 
displacement (disp). The displacement is either an eight-bit unsigned integer contained in 
the instruction word or an implied value of 1. 

Operation: operand address = AR/7+disp 

Assembler Syntax: *+ARA7(disp) 

Modification Field: 00000 



Example 6-4. Indirect with Pre-Displacement Subtract 

The address of the operand to be fetched is the contents of an auxiliary register (ARn) minus 
the displacement (disp). The displacement Is either an eight-bit unsigned Integer contained 
In the Instruction word or an implied value of 1. 

Operation: operand address = AR^T-disp 

Assembler Syntax: *-AR/7(disp) 

Modification Field: 00001 
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Example 6-5. Indirect with Pre-Displacement Add and Modify 


The address of the operand to be fetched is the sum of an auxiliary register (ARn) and the 
displacement (disp). The displacement Is either an eight-bit unsigned integer contained in 
the instruction word or an Implied value of 1. After the data Is fetched, the auxiliary register 
Is updated with the address generated. 


1 


Operation: 


operand address = ARn+disp 
ARA7=ARA7+disp 


Assembler Syntax: 


* + +AR/7(disp) 


Modification Field: 


00010 


31 24 23 0 



Example 6-6. Indirect with Pre-Displacement Subtract and Modify 


The address of the operand to be fetched is the contents of an auxiliary register (ARn) minus 
the displacement (disp). The displacement is either an eight-bit unsigned integer contained 
in the Instruction word or an implied value of 1. After the data is fetched, the auxiliary reg¬ 
ister is updated with the address generated. 

Operation: operand address = AR/7-disp 

ARr7=ARA7+disp 

Assembler Syntax: *--ARA7(disp) 

Modification Field: 00010 


disp 


31 24 23 0 
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Example 6-7. Indirect with Post-Displacement Add and Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the displacement (disp) is added to the auxiliary register. The dis¬ 
placement Is either an eight-bit unsigned integer contained In the instruction word or an 


implied value of 1 . 



Operation: 


operand address = ARa? 
AR/?=AR/7+disp 

Assembler Syntax: 


*ARA7+ + (disp) 

Modification Field: 


00100 

31 

24 23 



disp 


Example 6-8. Indirect with Post-Displacement Subtract and Modify 



The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand Is fetched, the displacement (disp) is subtracted from the auxiliary register. The 
displacement is either an eight-bit unsigned integer contained In the Instruction word or an 
Implied value of 1. 

Operation: operand address = AR/7 

ARn=AR/?-disp 

Assembler Syntax: *ARA7--(disp) 

Modification Field: 00101 
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Example 6-9. Indirect with Post-Displacement Add and Circular Modify 


The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the displacement (disp) is added to the contents of the auxiliary re¬ 
gister using circular addressing. This result is used to update the auxiliary register. The 
displacement is either an eight-bit unsigned integer contained in the instruction word or an 
implied value of 1. 

Operation: operand address = ARn 

ARn=circ(ARn+disp) 

Assembler Syntax: *ARA7+ + (disp)% 


Modification Field: 00110 

31 24 23 0 



Example 6-10. Indirect with Post-Displacement Subtract and Circular Modify 


The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the displacement (disp) Is subtracted from the contents of the auxil¬ 
iary register using circular addressing. This result is used to update the auxiliary register. 
The displacement is either an eight-bit unsigned integer contained in the Instruction word 
or an implied value of 1. 

Operation: operand address = ARn 

ARn=circ(AR/?-disp) 

Assembler Syntax: *AR/7--(disp)% 


Modification Field: 00111 

31 24 23 0 
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Example 6-11. Indirect with Pre-Index Add 

The address of the operand to be fetched is the sum of an auxiliary register (ARn) and an 
index register (IRO or IR1). generated. 

Operation: operand address = AR/7+IRm 

Assembler Syntax: *+AR/7(IR/y7) 

Modification Field: 01000 if m=0 


10000 ifm=1 

31 24 23 * 



Example 6-12. Indirect with Pre-Index Subtract 

The address of the operand to be fetched is the difference of an auxiliary register (ARn) and 
an index register (IRO or 1R1). 

Operation: operand address = ARn-\Hm 

Assembler Syntax: *-ARa7(IR/77) 

Modification Field: 01001 if m=0 


10001 ifm=1 

31 24 23 0 
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Example 6-13. Indirect with Pre-Index Add and Modify 


The address of the operand to be fetched is the sum of an auxiliary register (ARn) and an 
index register (IRO or IR1). After the data is fetched, the auxiliary register is updated with 
the address generated. 

Operation: operand address = AR/ 7 +IR /77 

AR/? = AR/7 + IR/77 

Assembler syntax: * + +ARa?(IR/77) 


Modification Field: 


01010 ifm=0 

10010 ifm=1 



Example 6-14. Indirect with Pre-Index Subtract and Modify 


The address of the operand to be fetched is the difference of an auxiliary register (ARn) and 
an index register (IRO or IR1). The resulting address becomes the new contents of the 
auxiliary register. 

Operation: operand address = AR/ 7 -IR /77 

AR/7 = ARA7-IR/77 

Assembler Syntax: *--ARn{\Hm) 


Modification Field: 


01011 ifm=0 

10011 ifm=1 
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Example 6-15. Indirect with Post-Index Add and Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or IR1) is added to the auxiliary register. 

Operation: operand address = ARn 

ARn=ARn+\Rm 

Assembler Syntax: *AR/7+ + (IR/77) 

Modification Field: 01100 if m=0 

10100 ifm=1 



Example 6-16. Indirect with Post-Index Subtract and Modify 


The address of the operand to be fetched Is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or IR1) is subtracted from the auxiliary regis¬ 
ter. 

Operation: operand address = ARa? 

ARA7 = AR/7-IR/77 

Assembler Syntax: *ARa?--(IR/77) 


Modification Field: 


01101 ifm=0 

10101 ifm=1 
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Example 6-17. Indirect with Post-Index Add and Circular Modify 


The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or IR1) is added to the auxiliary register. This 
value Is evaluated using circular addressing and replaces the contents of the auxiliary register. 

Operation: operand address = ARn 

ARn=c\rc{ARn + \Rm) 

Assembler Syntax: *ARA7+ + (IRm)% 

Modification Field: 01110 if m=0 

10110 ifm=1 

31 24 23 0 



Example 6-18. Indirect with Post-Index Subtract and Circular Modify 


The address of the operand to be fetched Is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or 1R1) Is subtracted from the auxiliary regis¬ 
ter. This value is evaluated using circular addressing and replaces the contents of the auxil¬ 
iary register. 


Operation: 
Assembler Syntax: 


operand address = AR/? 
AR/?=circ(ARA7-IR/77) 

*ARA7--(IRm)% 


Modification Field: 01111 if m=0 

10111 ifm=1 

31 24 23 0 
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Example 6-19. indirect with Post-Index Add and Bit-Reversed Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO) is added to the auxiliary register. This addi¬ 
tion is performed with a reverse-carry propagation and can be used to yield a bit-reversed 
(B) address. This value replaces the contents of the auxiliary register. 

Operation; operand address = ARn 

ARn=B{ARn+\RO) 

Assembler Syntax: *ARa7+ + (IR0)B 

Modification Field: 11001 

31 24 23 0 
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6.1.4 Short-Immediate Addressing 

In short-immediate addressing, the operand is a 16-bit immediate value con¬ 
tained in the 16 least-significant bits of the instruction word (expr). De¬ 
pending upon the data types assumed for the instruction, the short-immediate 
operand may be a two's-complement integer, an unsigned integer, or a float¬ 
ing-point number. The syntax for this mode is listed below. 

Syntax: expr 

Example 6-20 gives an instruction example with before and after instruction 
data. 

Example 6-20. Short-Immediate Addressing 

SUBI 1,R0 

Before instruction: After Instruction: 

RO = Oh RO = OFFFFFFFFh 
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6.1.5 Long-Immediate Addressing 

In long-immediate addressing, the operand is a 24-bit immediate value con¬ 
tained in the 24 least-significant bits of the instruction word (expr). The syn¬ 
tax for this mode is listed below. 

Syntax: expr 

Example 6-21 gives an Instruction example with before and after instruction 
data. 

Example 6-21. Long-Immediate Addressing 

BR SOOOh 

Before Instruction 
PC = Oh 


After Instruction: 
PC = SOOOh 


6.1.6 PC-Relative Addressing 

PC-relative addressing is used for branching. It replaces the value of the PC 
based upon the contents of the 16 least significant bit of the instruction word. 
The assembler takes the src (a label or address) specified by the user and 
generates a displacement. If the branch is a standard branch, this displace¬ 
ment Is equal to the label - (PC+1). If the branch is a delayed branch, this 
displacement Is equal to the label - (PC+3). 
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The displacement is stored as a 16-bit signed Integer in the least significant 
bits of the instruction word. 

Syntax: expr 


Example 6-22 gives an instruction example with before and after instruction 
data. 


Example 6-22. PC-Relative Addressing 


BU NEWPC 


pc=l, NEWPC=5, displacement=3 


Before Instruction: 

PC = 1h 


After Instruction: 

PC = 5h 
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6.2 Groups of Addressing Modes 

The types of addressing are used to form the following five groups of ad¬ 
dressing modes: 

• General addressing modes (G) 

• Three-operand addressing modes (T) 

• Parallel addressing modes (P) 

• Long-immediate addressing mode 

• Conditional-branch addressing modes (B) 

These groups of addressing modes are discussed In the following sections. 

6.2.1 General Addressing Modes 

Instructions that use the general addressing modes are general-purpose in¬ 
structions, such as ADDI, MPYF, and LSH. Such instructions are usually of 
the form: 

dst operation src -► dst 

where the destination operand is signified by dst, the source operand by src, 
and 'operation' defines an operation to be performed using the general ad¬ 
dressing modes to specify certain operands. Bits 31 -29 are zero, indicating 
general addressing mode instructions. Bits 22 and 21 specify the general ad¬ 
dressing mode (G) field, which defines how bits 15 through 0 are to be in¬ 
terpreted for addressing the src operand. 

Options for bits 22 and 21 (G field) are as follows: 

0 0 register (all CPU registers unless specified otherwise) 

01 direct 
1 0 indirect 
1 1 immediate 

If the src and £/5f fields contain register specifications, the value In these fields 
contains the CPU register addresses as defined by Table 6-1. For the general 
addressing modes, the following values of ARn are valid: 

ARn, 0 n :< 7 

Figure 6-2 shows the encoding for the general addressing modes. The nota¬ 
tion 'modn' Indicates the modification field that goes with the ARn field. Refer 
to Table 6-2 for further information. 


31 2928 23 22 21 20 1615 11 10 87 54 0 


0 0 0 

operation 

EKI 

dst 

U 

(0 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

0 0 0 

operation 

in 

dst 

direct 

0 0 0 

operation 

m 

dst 

modn 1 ARn | disp 

0 0 0 

operation 

on 

dst 

immediate 


I G I Destination I Source Operands I 


Figure 6-2. Encoding for General Addressing Modes 
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6.2.2 Three-Operand Addressing Modes 

Instructions that use the three-operand addressing modes, such as ADDIS, 
LSH3, CMPF3. or X0R3, are usually of the form: 

SRC1 operation SRC2 dst 

where the destination operand is signified by dst, the source operands by 
SRC1 and SRC2, and 'operation' defines an operation to be performed. Note 
that the '3' can be omitted from three-operand instructions. 

Bits 31-29 are set to the value of 001, indicating three-operand addressing 
mode Instructions. Bits 22 and 21 specify the three-operand addressing mode 
(T) field, which defines how bits 15-0 are to be interpreted for addressing the 
SRC operands. Bits 15-8 are used to define the SRC1 address, and bits 7-0 
to define the SRC2 address. Options for bits 22 and 21 (T) are as follows; 


T 

SRC1 

SRC2 

0 0 

Register 

Register 

0 1 

Indirect 

Register 

1 0 

Register 

Indirect 

1 1 

Indirect 

Indirect 


Figure 6-3 shows the encoding for three-operand addressing. If the SRC1 
and SRC2 fields use the same auxiliary register, both addresses are correctly 
generated. However, only the value created by the SRC1 field is saved in the 
auxiliary register specified. The assembler issues a warning if this condition 
Is specified by the user. 

The following values of ARn and ARm are valid: 

ARn,0 ^ n < 7 
ARm,0 < m ^ 7 

The notation "modm" or "modn" indicates the modification field goes with the 
ARm or ARn field respectively. Refer to Table 6-2 for further information. 

In Indirect addressing of the three-operand addressing mode, displacements 
(If used) are allowed of 0 or 1, and the index registers (IRQ and IR1) can be 
used. The displacement of 1 Is implied and is not explicitly coded in the In¬ 
struction word. 


31 29 28 23 22 21 20 16 15 13 12 11 10 8 7 5 4 3 2 0 


0 

0 

1 

operation 

0 

0 

dst 


0 0 

srcl 

0 

0 0 

src2 

0 

0 

1 

operation 

0 

1 

dst 

modn 

ARn 

0 

0 0 

src2 

0 

0 

1 

operation 

1 

0 

dst 

0 

0 0 

srcl 

modn 

ARn 

0 

0 

1 

operation 

D 

D 

dst 

modn 

ARn 

modm 

ARm 


I T I I SRC1 I SRC2 I 


Figure 6-3. Encoding for Three-Operand Addressing Modes 
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6.2.3 Parallel Addressing Modes 

Instructions that use parallel addressing (indicated by || (two vertical bars)) 
allow for the greatest amount of parallelism possible. The destination oper¬ 
ands are indicated as d1 and d2, signifying dst^ and dstl, respectively (see 
Figure 6-4). The source operands, signified by srd and src2, use the ex¬ 
tended-precision registers. The parallel operation to be performed is notated 
as 'operation'. 


31 30 29 26 25 24 23 22 21 19 18 16 15 11 10 8 7 3 2 0 


1 0 1 operation 


ED 

ED 

srcl 

src2 

modn 

ARn 

modm 

ARm 


srcz I src^ I 


Figure 6-4. Encoding for Parallel Addressing Modes 
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The parallel addressing mode (P) field specifies how the operands are to be 
used, i.e., whether they are source or destination. The specific relationship 
between the P field and the operands is detailed in the description of the In¬ 
dividual parallel instructions (see Section 11). However, the operands are al¬ 
ways encoded in the same way. Bits 31 and 30 are set to the value of 10, 
Indicating parallel addressing mode instructions. Bits 25 and 24 specify the 
parallel addressing mode (P) field, which defines how bits 21 -0 are to be in¬ 
terpreted for addressing the src operands. Bits 21 -1 9 are used to define the 
src^ address, bits 18-16 to define the srcl address, bits 15-8 the srcZ address, 
and bits 7-0 the srcA address. The notation 'modn' and 'modm' indicate 
which modification field goes with which ARn or ARm (auxiliary register) 
field, respectively. The parallel addressing operands are listed below. 

src1 0 < src1 ^ 7 (extended-precision registers R0-R7) 

src2 0 < src2 < 1 (extended-precision registers R0-R7) 

d1 If 0, dstl is RO. If 1, dsf/ is R1. 

d2 If 0, dst2 is R2. If 1, dst2 is R3. 

P 0 < P < 3 

src3 indirect (disp = 0, 1, IRO, IR1) 
src4 indirect (disp = 0, 1, IRO, IR1) 

As In the three-operand addressing mode. Indirect addressing In the parallel 
addressing mode allows for displacements of 0 or 1 and the use of the index 
registers (IRO and IR1). The displacement of 1 Is implied and is not explicitly 
coded in the instruction word. 

In the encoding shown for this mode in Figure 6-4, If the src2 and srcA fields 
use the same auxiliary register, both addresses are correctly generated, but 
only the value created by the src3 field is saved In the auxiliary register speci¬ 
fied. The assembler issues a warning if this condition Is specified by the user. 
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6.2.4 Long-Immediate Addressing Mode 

The long-immediate addressing mode is used to encode the program control 
instructions (BR, BRD, and CALL), for which It Is useful to have a 24-bit ab¬ 
solute address contained in the instruction word. The unconditional branches, 
BR (standard) and BRD (delayed), use the long-immediate addressing mode. 
Bits 31-25 are set to the value of 0110000, indicating long-immediate ad¬ 
dressing mode instructions. Selection of bit 24 determines the type of branch: 
D = 0 for a standard branch or D = 1 for a delayed branch. The long-immed¬ 
iate operand is the 24-bit src. These instructions are encoded as shown in 
Figure 6-5. 

31_ 25 24 23 _0 

I 0 1 1 0 0 0 0 I D I src | 

Figure 6-5. Encoding for Long-Immediate Addressing Mode 


6.2.5 


Conditional-Branch Addressing Modes 

Instructions using the conditional-branch addressing modes (^cond, 
EcondD, CfKLLcond, DBco/?d, and DBcondD) can perform a variety of con¬ 
ditional operations. Bits 31 -27 are set to the value of 01101, indicating con¬ 
ditional-branch addressing mode Instructions. Bit 26 Is set to 0 or 1, the 
former selects DEcond, the latter Bcond. Selection of bit 25 determines the 
conditional-branch addressing mode (B). If B = 0, register addressing is 
used; if B = 1, PC-relative addressing is used. Selection of bit 21 sets the type 
of branch: D = 0 for a standard branch or D = 1 for a delayed branch. The 
condition field {cond) specifies the condition checked to determine what ac¬ 
tion to take, i.e., whether or not to branch (see Section 11 for a list of condi¬ 
tion codes). Figure 6-6 shows the encoding for conditional-branch 
addressing. 
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Figure 6-6. Encoding for Conditional-Branch Addressing Modes 
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6.3 Circular Addressing 

Many algorithms, such as convolution and correlation, require the implemen¬ 
tation of a circular buffer in memory. In convolution and correlation, the cir¬ 
cular buffer is used to Implement a sliding window which contains the most 
recent data to be processed. As new data Is brought in, the new data over¬ 
writes the oldest data. Key to the Implemention of a circular buffer Is the Im¬ 
plementation of a circular addressing mode. This section describes the circular 
addressing mode of the TMS320C30. 

The blocksize register (BK) specifies the size of the circular buffer. Informa¬ 
tion concerning the lower 16 bits of the BK register plus a user-selected aux¬ 
iliary register (ARn) are used to specify the bottom of the circular buffer. The 
information concerning the BK register Is the location of the the first 1 bit, 
counting from the most-significant bit to the least-significant bit. In the lower 
16 bits. With the location of the first 1 bit specified as bit N, the address at 
the top of the buffer is referred to as the effective base (EB) and Is equal to 
bits 31 through (N+1) of ARn with bits N through 0 of EB being zero. 
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Figure 6-7 Illustrates the relationships between the blocksize register (BK), the 
auxiliary registers (ARn), the bottom of the circular buffer, the top of the cir¬ 
cular buffer, and the index into the circular buffer. 
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FIRST 1 AT LOCATION N 



LEGEND: 

ARn = auxiliary register n L = low-order bits 

BK = blocksize register L' = new low-order bits 

EF = effective base LSB = least-significant bit 

H = high-order bits N = bit value 

Figure 6-7. Flowchart for Circular Addressing 


In circular addressing, 'index' refers to the N LSBs of the auxiliary register se¬ 
lected, and 'step' is the quantity being added to or subtracted from the auxil¬ 
iary register. The following two rules must be followed when using circular 
addressing; 

• The step used must be less than or equal to the blocksize. 

• The first time the circular queue is addressed, the auxiliary register must 
be pointing to an element In the circular queue. 
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The algorithm for circular addressing is as follows: 

If 0 2 $ index + step < BK: 

index = index + step. 

Else If Index + step ^ BK: 

Index = index + step - BK. 
Else if Index + step < 0: 

Index = index + step + BK. 


Figure 6-8 shows how the circular buffer Is implemented. It illustrates the re¬ 
lationship of the quantities generated and the elements in the circular buffer. 


Address 
31 N + 1 N 


Data 

Top of Circular Buffer 


Effective Base (EB) 


H...H 


0...0 




31 N+1 N 0 

Aux. Register (ARn) | 

H...H 

L..L 


31 N+1 N 


H...H LSBsofBK 


Element 0 


Element 1 


Element (N LSBs of ARn) 


Last Element 


Last Element + 1 


Figure 6-8. Circular Buffer Implementation 


Figure 6-9 provides an example that shows the operation of circular address¬ 
ing. Assuming that all ARs are four bits, let ARO = 0000,and BK = 0110 
(blocksize of 6). This example shows a sequence of modifications and the 
resulting value of ARO. It also shows how the pointer steps through the cir¬ 
cular queue, with a variety of step sizes (both incrementing and decrement- 
ing). 
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*ARO ; ARO = 0 (0th value) 
*AR0+-l-(5)% ; ARO = 5 (1st value) 
*AR0++(2)% ; ARO = 1 (2nd value) 
*ARO—(3)% ; ARO = 4 (3rd value) 
*AR0++(6)% ; ARO = 4 (4th value) 
*ARO—% ; ARO = 3 (5th value) 

Value Data Address 



Figure 6-9. Circular Addressing Example 


Circular addressing is especially useful for the Implementation of FIR filters. 
Figure 6-10 shows one possible data structure for FIR filters. Note that the 
Initial value of ARO points to h(N-1), and the Initial value of AR1 points to 
x(0). Circular addressing is used in the TMS320C30 code for the FIR filter 
shown in Figure 6-11. 

Impulse Response Input Samples 



Figure 6-10. Data Structure for FIR Filters 
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* 

it 

Initialization 



LDI 

N,BK 

Load block size. 


LDI 

H,ARO 

Load pointer to impulse response. 


LDI 

X,AR1 

Load pointer to bottom of input 

* 

★ 



sample buffer. 

TOP LDF 

IN, R3 

Read input sample. 


STF 

R3,*ARl++% 

Store with other samples, 
and point to top of buffer. 


LDF 

0,R0 

Initialize RO. 


LDF 

0,R2 

Initialize R2. 

* 

if 

Filter 




RPTS 

N~1 

Repeat next instruction. 


MPYF3 

*ARO++%,*ARl++% 

,R0 

1 1 

ADDF3 

R0,R2,R2 i 

; Multiply and accumulate. 

* 

ADDF 

RO,R2 ] 

; Last product accumulated. 


STF 

R2,Y i 

; Save result. 


B 

TOP ; 

; Repeat. 


Figure 6-11. FIR Filter Code Using Circular Addressing 
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6.4 Bit-Reversed Addressing 

Bit-reversed addressing on the TMS320C30 is useful in FFT algorithms using 
a variety of radices. One auxiliary register is used as a pointer to the physical 
location of a data value. IRO Is used to specify the size of the FFT; e.g., the 
value contained in IRO must be equal to 2^ where n Is an integer. By adding 
IRO to the auxiliary register using bit-reversed addressing, addresses are gen¬ 
erated in a bit-reversed fashion. 


To Illustrate this kind of addressing, assume eight-bit auxiliary registers. Let 
AR2 contain the value 0110 0000 (96). This Is the base address of the data 
in memory. Let IRO contain the value 0000 1000 (8). Figure 6-12 shows a 
sequence of modifications of AR2 and the resulting values of AR2. 


*AR2 ; AR2 
*AR2++(IR0)B ; AR2 
*AR2++(IR0)B ; AR2 
*AR2++(IR0)B ; AR2 
*AR2+-I-(IR0)B ; AR2 
*AR2++(IR0)B ; AR2 
*AR2++(IR0)B ; AR2 
*AR2++(IR0)B ; AR2 


= 0110 0000 (0th value) 
= 0110 1000 (1st value) 
= 0110 0100 (2nd value) 
= 0110 1100 (3rd value) 
= 0110 0010 (4th value) 
= 0110 1010 (5th value) 
= 0110 0110 (6th value) 
= 0110 1110 (7th value) 


Figure 6-12. Bit-Reversed Addressing Example 
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Table 6-3 shows the relationship of the Index steps and the four LSBs of AR2. 
It can be seen that the four LSBs can be found by reversing the bit pattern of 
the steps. 

Table 6-3. Index Steps and Bit-Reversed Addressing 


STEP 

BIT PATTERN 

BIT-REVERSED PATTERN 

BIT-REVERSED STEP 

0 


0000 

0 

1 

0001 

1000 

8 


0010 

0100 

4 


0011 

1100 

12 



0010 

2 


0101 

1010 

10 


0110 

0110 

6 

H 

0111 

1110 

14 
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6.5 System and User Stack Management 

The TMS320C30 provides a dedicated system stack pointer (SP) for building 
stacks in memory. The auxiliary registers can also be used to build a variety 
of more general linear lists. This section discusses the implementation of the 
following types of linear lists: 

Stack A linear list for which all insertions and deletions are made at one 
end of the list. 

Queue A linear list for which all insertions are made at one end of the list 
and all deletions are made at the other end. 

Deque A 'double-ended queue' linear list for which Insertions and deletions 
are made at the either end of the list. 

The system stack pointer (SP) Is a 32-bit register that contains the address of 
the top of the system stack. The system stack fills from low-memory address 
to high-memory address (see Figure 6-13). The SP always points to the last 
element pushed onto the stack. A push performs a preincrement; and a pop, 
a postdecrement of the system stack pointer. 

The program counter is pushed on the system stack on subroutine calls, traps, 
and interrupts. It Is popped from the system stack on returns. The system stack 
can be pushed and popped using the PUSH, POP, PUSHF, and POPF in¬ 
structions. 



HIGH MEMORY 

Figure 6-13. System Stack Configuration 


6.5.1 Stacks 

Stacks can be built from low to high memory or high to low memory. Two 
cases for each type of stack are shown. Stacks can be built using the 
preincrement/decrement and postincrement/decrement modes of modifying 
the auxiliary registers (AR). Stack growth from high-to-low memory can be 
Implemented in two ways: 

CASE 1: Stores to memory using *--ARn to push data on the stack and reads 
from memory using *ARn + -i- to pop data off the stack. 

CASE 2: Stores to memory using *ARn-- to push data on the stack and reads 
from memory using * ++ARn to pop data off the stack. 
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Figure 6-14 illustrates these two cases. The only difference Is that in using 
case 1, the AR always points to the top of the stack, and in case 2, the AR 
always points to the next free location on the stack. 


ARn -► 


CASE 1 CASE 2 

LOW MEMORY LOW MEMORY 


(FREE) 

ARn 

(FREE) 

TOP OF STACK 


TOP OF STACK 




BOTTOM OF STACK 


BOTTOM OF STACK 


HIGH MEMORY HIGH MEMORY 

Figure 6-14. Implementions of High-to-Low Memory Stacks 


6 


Stack growth from low-to-high memory can be implemented in two ways: 

CASE 3: Stores to memory using * + +ARn to push data on the stack and 
reads from memory using *ARn-- to pop data off the stack. 

CASE 4: Stores to memory using *ARn + + to push data on the stack and 
reads from memory using *--ARn to pop data off the stack. 

Figure 6-15 shows these two cases. In the case 3, the AR always points to 
the top of the stack. In case 4, the AR always points to the next free location 
on the stack. 


CASE 3 CASE 4 

LOW MEMORY LOW MEMORY 


BOTTOM OF STACK 


BOTTOM OF STACK 




TOP OF STACK 


TOP OF STACK 

(FREE) 

ARn -► 

(FREE) 


HIGH MEMORY HIGH MEMORY 


Figure 6-15. Implementions of Low-to-High Memory Stacks 
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6.5.2 Queues and Deques 

The implementations of queues and deques is based upon the manipulation 
of the auxiliary registers for user stacks. For queues, two auxiliary registers 
are used, one to mark the front of the queue from which data is popped and 
the other to mark the rear of the queue where data is pushed. 

For deques, two auxiliary registers are also necessary. One Is used to mark 
one end of the deque, and the other is used to mark the other end. Data can 
be popped or pushed from either end. 


6 
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Section 7 


Program Flow Control 


The TMS320C30 provides a complete set of flexible and powerful constructs 
that allow for software control of the program flow. These consist of two main 
♦ types: repeat modes and branching (standard and delayed). When program¬ 
ming includes a combination of repeat modes, standard branches, and delayed 
branches, the type best suited for a particular application can be selected. 

Several interlocked operations instructions provide a flexible means of multi¬ 
processor support. Through the use of external signals, these Instructions al¬ 
low for powerful synchronization mechanisms. They also guarantee the 
Integrity of the communication and result in a high-speed operation. 

The TMS320C30 supports a nonmaskable external reset signal and a number 
of internal and external interrupts. These functions can be programmed for a 
particular application. 

Major topics discussed in this section Include: 

• Repeat Modes (Section 7.1 on page 7-2) 

- Initialization 

- Operation 

• Delayed Branches (Section 7.2 on page 7-7) 

• Interlocked Operations (Section 7.3 on page 7-8) 

• Reset Operation (Section 7.4 on page 7-12) 

• Interrupts (Section 7.5 on page 7-15) 
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7.1 Repeat Modes 

The repeat modes of the TMS320C30 allow for the implementation of zero- 
overhead looping. For many algorithms, there is an inner kernel of code where 
most of the execution time is spent. Using the repeat modes allows these 
time-critical sections of code to be executed in the shortest possible time. 

The TMS320C30 provides two instructions to support zero-overhead looping: 
RPTB (repeat a block of code) and RPTS (repeat a single Instruction). RPTB 
allows for a block of code to be repeated a specified number of times. RPTS 
allows a single instruction to be repeated a number of times and reduces the 
bus traffic by fetching the instruction only once. 

Three registers (RS, RE, and RC) are associated with the updating of the 
program counter when updated In a repeat mode. Table 7-1 describes these 
registers. 


Table 7-1. Repeat Mode Registers 


REGISTER 

FUNCTION 

RS 

Repeat Start Address Register. Holds the address of the first instruction 
of the block of code to be repeated. > 

RE 

Repeat End Address Register. Holds the address of the last instruction 
of the block of code to be repeated. 

RC 

Repeat Count Register. Contains one less than the number of times the 
block remains to be repeated. 


7.1.1 Repeat Mode Initialization 

There are two bits that are very important to the operation of RPTB and RPTS, 
the RM and S bits. 

The RM (repeat mode flag) bit In the status register specifies if the processor 
is running in the repeat mode. If RM = 0, fetches are not made in repeat 
mode. If RM = 1, fetches are made in repeat mode. 

The S bit Is hidden from the user, but is necessary to fully describe the oper¬ 
ation of RPTB and RPTS. If S = 0, the CPU is not performing fetches in the 
repeat-single mode. If S = 1 and RM = 1, the CPU is performing fetches in 
the repeat-single mode. 

The correct operation of the repeat modes requires that all of the above regis¬ 
ters and status register fields be initialized correctly. The RPTB and RPTS In¬ 
structions perform this initialization in slightly different ways (see Sections 
7.1.2 and 7.1.3). 
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7.1.2 RPTB Initialization 

When RPTB src is executed, the following operations take place: 

1) PC + 1 - RS 

2) src RE 

3) 1 RM status register bit 

4) 0 S bit 

Step 1 loads the start address of the block Into RS. Step 2 loads the src into 
the RE (end address of the block). The src operand Is a 24-blt value con¬ 
tained In the instruction word. Step 3 sets the status register to Indicate the 
repeat mode of operation. Step 4 indicates that this is the repeat block mode 
of operation. 

The last bit of information required is the number of times to repeat the block. 
The value is determined by properly initializing the RC (repeat count) register. 
Since the execution of RPTB does not load the RC, this register must be 
loaded explicitly by the user. The typical setup of the block repeat operation 
is shown below. 

LDI 15,RC ; 15 “► RC 

RPTB LOOP ; LOOP RE, PC + 1 -► RS, 1 RM, 0 S 

The repeat modes repeat a block of code at least once In a typical operation. 
The repeat counter should be loaded with one less than the number of times 
to repeat the block; i.e., a value of 0 in RC repeats the block of code one time. 
All block repeats initiated by RPTB can be interrupted. 

7.1.3 RPTS Initialization 

When RPTS src is executed, the following sequence of operations occurs: 

1) PC + 1 RS 

2) PC + 1 RE 

3) 1 RM status register bit 

4) 1 - S bit 

5) src -* RC 

The RPTS instuctlon loads all registers and mode bits necessary for the oper¬ 
ation of the single instruction repeat mode. Step 1 loads the start address of 
the block Into RS. Step 2 loads the end address into the RE (end address of 
the block). Since this is a repeat of a single instruction, the start address and 
the end address are the same. Step 3 sets the status register to indicate the 
repeat mode of operation. Step 4 indicates that this Is the repeat single-in¬ 
struction mode of operation. The operand src is loaded Into RC. 

Repeats of a single instruction initiated by RPTS are not interruptible, since 
the RPTS fetches the instruction word only once and then keeps it in the in¬ 
struction register for reuse. An interrupt would cause the instruction word to 
be lost. The refetching of the instruction word from the instruction register 
reduces memory accesses and, in effect, acts as a one-word program cache. 
If it is necessary to have a single instruction that is repeatable and interruptible, 
the RPTB instruction may be used on this single instruction. 
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7.1.4 Repeat Mode Operation 


The information in the repeat mode registers and associated control bits is 
used to control the modification of the PC when the fetches are being made 
in repeat mode. The repeat modes compare the contents of the RE register 
with the program counter (PC). If they match and the repeat counter Is non¬ 
negative, the repeat counter Is decremented, the PC Is loaded with the repeat 
start address, and the processing continues. The fetches and appropriate sta¬ 
tus bits are modified as necessary. Note that the repeat counter (RC) is never 
modified when RM Is 0. The maximum number of repeats occurs when RC = 
OSOOOOOOOh. This will result in 080000001 h repetitions. The detailed algo¬ 
rithm for the update of the PC is described in Figure 7-1. 

if RM == 1 

if S 


if first time through 

fetch instruction from memory 

else 

fetch instruction from IR 


If in repeat mode (RPTB or RPTS) 
If RPTS 

If this is the first fetch 
Fetch instruction from memory 
If not the first fetch 
Fetch instruction from IR 


RC - 1 RC 
if RC < 0 
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0 ST(RM) 
0 S 

PC + 1 PC 


; Decrement RC 
; If RC is negative 
; Repeat single mode completed 
; Turn off repeat mode bit 
; Clear S 
; Increment PC 


else if S = = 0 

fetch instruction from memory 
if PC == RE 

RC - 1 RC 
if RC > 0 
RS PC 
else if RC < 0 
0 ST(RM) 

0 -► S 

PC + 1 PC 


If RPTB 

Fetch instruction from memory 

If this is the end of the block 

Decrement RC 

If RC is not negative 

Set PC to start of block 

If RC is negative 

Turn off repeat mode bits 

Clear S 

Increment PC 


Figure 7-1. Repeat Mode Control Algorithm 


The RPTB and RPTS are four-cycle Instructions. These four cycles of over¬ 
head are only Incurred on the first pass through the loop. All subsequent 
passes through the loop are accomplished with zero cycles of overhead. In 
Example 7-1, the block of code from STLOOP to ENDLOP is repeated sixteen 
times. 

Example 7-1. RPTB Operation 

LD 15,RC ; Load repeat counter with 15 

RPTB ENDLOP ; Execute the block of code 

STLOOP ; from STLOOP to ENDLOP 16 times 


ENDLOP 
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Using this mode of modifying the PC allows for a straightforward analysis of 
what would happen in the case of branches within the block. It Is best to look 
at the operation from the point of view that the next value of the PC will be 
either PC + 1 or the contents of the RS register. It is thus apparent that this 
method of block repeat allows for any amount of branching within the re¬ 
peated block. Execution can go anywhere within the user's code via Interrupts, 
subroutine calls, etc. For proper modification of the loop counter, the last in¬ 
struction of the loop must be fetched. The repeating of the loop can be 
stopped prior to completion by writing a 0 Into the repeat counter or writing 
0 into the RM bit of the status register. 

Since the block repeat modes modify the program counter, other Instructions 
cannot modify the program counter at the same time. Two rules apply here: 

1) The last instruction in the block (or the only instruction In a block of size 
one) cannot be a Bcond, BR, D^cond, CALL, OALLcond, IRAPcond, 
RETIcond, RETScoA7d, IDLE, RPTB, or RPTS. Example 7-2 shows an 
incorrectly placed standard branch. 

2) None of the last four instructions from the bottom of the block (or the 
only instruction in a block of size one) can be a BcondD, BRD, or 
DBcondD. Example 7-3 shows an incorrectly placed delayed branch. 

If either of these rules are violated, the PC will be undefined. 

Example 7-2. Incorrectly Placed Standard Branch 

LD 15,RC ; 

RPTB ENDLOP ; 

STLOOP 

JCS 

ENDLOP BR OOPS ; 

Example 7-3. Incorrectly Placed Delayed 

LD 15,RC 

RPTB ENDLOP ; 

STLOOP ; 

GAF 

BRD OOPS ; 

ADDF 
MPYF 

ENDLOP SUBF 

Block repeats (RPTB) are nestable. Since all of the control is defined by the 
RS, RE, RC, and ST registers, saving and restoring these registers allows for 
their nesting. The RM bit in the status register can be used to determine if the 
block repeat mode is active. For example. If an Interrupt service routine is 
written which requires the use of RPTB, it Is possible that the interrupt asso¬ 
ciated with the routine may occur during another block repeat. The interrupt 
service routine can check the RM bit. If this bit is set, the Interrupt routine 
saves RS, RE, RC, and ST. The interrupt routine can then perform a block re¬ 
peat. Before returning to the interrupted routine, the Interrupt routine restores 


Load repeat counter with 15 

Execute block of code 

from STLOOP to ENDLOP 16 times 


This branch violates rule 1 


Branch 

Load repeat counter with 15 

Execute block of code 

from STLOOP to ENDLOP 16 times 


This branch violates rule 2 
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RS, RE, RC, and ST. If the RM bit is not set the save and restore of these re¬ 
gisters is not necessary. 
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7.2 Delayed Branches 

The branching capabilities of the TMS320C30 include two main types: stan¬ 
dard and delayed branches. Standard branches empty the pipeline before 
performing the branch to guarantee correct management of the program 
counter. This results in a TMS320C30 branch taking four cycles. Included in 
this class are calls, returns, and traps. 

Delayed branches on the TMS320C30 do not empty the pipeline, but rather 
guarantee that the next three instructions will be executed before the program 
counter is modified by the branch. The result is a branch that only requires a 
single cycle, thus making the speed of the delayed branch very close to the 
optimal block repeat modes of the TMS320C30. However, unlike block repeat 
modes, delayed branches may be used In situations other than looping. Every 
delayed branch has a standard branch counterpart that is used when a delayed 
branch cannot be used. The delayed branches of the TMS320C30 are 
BcondD, BRD, and DBcondD. 


Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. They do not depend 
upon the instructions following the delayed branch. Delayed branches are 
guaranteed to allow the three following instructions to be executed regardless 
of other pipeline conflicts. 


When a delayed branch is fetched, it remains pending until the three following 
instructions are executed. None of the three instructions that follow a delayed 
branch can be Bcond, BcondD, BR, BRD, DBcond, DBcondD, CALL, 
C/KLLcond, JRAPcond, REJlcond, RETScond, RPTB, RPTS, or IDLE, (see 
Example 7-4). 



Delayed branches disable interrupts until the three Instructions following the 
delayed branch are completed. This is independent of whether or not the 
branch is taken. 


If delayed branches are used incorrectly, the PC will be undefined. 

Example 7-4. Incorrectly Placed Delayed Branches 

Bl: BD LI 

NOP 
NOP 

B2: B L2 ; This branch is incorrectly placed 

NOP 
MJH 
NOP 
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7.3 Interlocked Operations 

One of the most common multiprocessing configurations is the sharing of 
global memory by multiple processors. In order to allow multiple processors 
to access this global memory and share data In a coherent manner, some sort 
of arbitration or handshaking Is necessary. This requirement for arbitration is 
the purpose of the TMS320C30 Interlocked operations. 

The TMS320C30 provides a flexible means of multiprocessor support with five 
Instructions, referred to as interlocked operations. Through the use of external 
signals, these Instructions allow for powerful synchronization mechanisms. 
They also guarantee the integrity of the communication and result in a high¬ 
speed operation. The interlocked-operation instruction group Is listed In Table 
7-2. 


Table 7-2. Interlocked Operations 


MNEMONIC 

DESCRIPTION 

OPERATION 

LDFI 

Load floating-point value into a register, 
interlocked 

Signal interlocked 
src dst 

LDII 

Load integer into a register. Interlocked 

Signal interlocked 
src dst 

SIGI 

Signal, interlocked 

Signal interlocked 
Clear interlock 

STFI 

Store floating-point value to memory, 
interlocked 

src dst 

Clear interlock 

STII 

Store integer to memory, interlocked 

src dst 

Clear interlock 


The interlocked operations use the two external flag pins, XFO and XF1. XFO 
must be configured as an output pin, and XF1 as an input pin. When config¬ 
ured in this manner, XFO signals an interlock operation request, and XFl acts 
as an acknowledge signal for the requested interlocked operation. In this 
mode, XFO and XF1 are treated as active-low signals. 

The external timing for the interlocked loads and stores are the same as stan¬ 
dard load and stores. The interlocked loads and stores m ay b e extend ed like 
standard accesses, by using the appropriate ready signal (RDY or XRDY). 


The LDFI and LDII instructions perform the following actions; 

1) Simultaneously set XFO to 0 and begin a read cycle. The timing of XFO 
Is similar to that of the address bus during a read cycle. 

2) Execute an LDF or LDI instruction and extend the read cycle until XFl 
is set to 0 and a ready (RDY or XRDY) is signalled. 

3) Leave XFO set to 0 and end the read cycle. 

The read/write operation is identical to any other read/write cycle except for 
the special use of XFO and XFl. The src operand for LDFI and LDII is always 
a direct or Indirect memory address. XFO is set to 0 only if the src is located 
off-chip; i.e., (STRB, MSTRB or lOSTRB is active), or the src is one of the on- 
chip peripherals. If on-chip memory is accessed, then XFO is not asserted, and 
the operation is as an LDF or LDI from internal memory. 
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The STFI and STM instructions perform the following operations: 

1) Simultaneously set XFO to 1 and begin a write cycle. The timing of XFO 
is similar to that of the address bus during a write cycle. 

2) Execute an STF or STI Instruction and extend the write cycle until a 
ready (RDY or XRDY) Is signalled. 

As In the case for LDFI and LDII, the dst of STFI and STM affects XFO if dst 
is located off-chip (STRB, MSTRB, or lOSTRB is active), or the src Is one of the 
on-chip peripherals. If on-chip memory is accessed, then XFO Is not asserted 
and the operations are as an STF or STI to Internal memory. 

The SIGI Instruction functions as follows: 

1) Sets XFO to 0. 

2) Idles until XF1 is set to 0. 

3) Sets XFO to 1 and ends the operation. 

While the LDFI, LDII, and SIGI Instructions are waiting for XF1 to be set to 
0, they may be interrupted. LDFI and LDII require a ready signal In order to 
be Interrupted. This allows the user to implement protection mechanisms 
against deadlock conditions by interrupting an interlocked load that has taken 
too long. Upon return from the interrupt, the next instruction is executed. The 
STFI and STil Instructions are not interruptible. 

Interlocked operations can be used to implement a busy-waiting loop, to ma¬ 
nipulate a multiprocessor counter, to Implement a simple semaphore mech¬ 
anism, or to perform synchronization between two TMS320C30s. The 
following examples illustrate the usefulness of the interlocked operations in¬ 
structions. Example 7-5 shows the Implementation of a busy-waiting loop. If 
location LOCK Is the interlock for a critical section of code, and a nonzero 
means the lock is busy, the algorithm for a busy-waiting loop can be used as 


shown In Example 7-5. 



Example 7-5. Busy-Waiting Loop 



LDI: 

1,R0 

@LOCK,Rl 

; Put 1 in RO 


LI: LDII 

; Interlocked 

operation begun 



; Contents of 

LOCK R1 

STII 

RO,@LOCK 

; Put RO (= 1) 

into LOCK, XFO 



; Interlocked 

operation ended 

BNZ 

LI 

; Keep trying 

until LOCK = 0 


Example 7-6 shows how a location COUNT may contain a count of the 
number of times a particular operation needs to be performed. This operation 
may be performed by any processor in the system. If the count is zero, the 
processor waits until It is nonzero before beginning processing. The algorithm 
for modifying COUNT correctly Is shown in Example 7-6. 





Program Flow Control - Interlocked Operations 


Example 7-6. Multiprocessor Counter Manipulation 


OR 

4,IOF 

LDII 

@C0UNT,R1 

BZ 

CT 

SUBI 

1,R1 

STII 

Rl,@COUNT 


XFO = 1 

Interlocked operation ended 
Interlocked operation begun 
Contents of COUNT -► R1 
If COUNT = 0, keep trying 
Decrement Rl(= COUNT) 

Update COUNT, XFO = 1 
Interlocked operation ended 


Figure 7-2 illustrates multiple TMS320C30s sharing global memory and using 
the interlocked instructions as in Examples 7-7, 7-8, 7-9, and 7-10. 



Figure 7-2. Multiple TMS320C30s Sharing Global Memory 


Sometimes it may be necessary for several processors to access some shared 
data or other common resources. The portion of code which must access the 
shared data is called a critical section. 

To ease the programming of critical sections, semaphores may be used. Se¬ 
maphores are variables which can only take non-negative Integer values. Two 
primitive. Indivisible, operations are defined on semaphores, namely (with S 
being a semaphore): 
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V(S) : S + 1 S 

P(S); P: if (S == 0), go to P 
else S ~ 1 S 


Indivisibility of V(S) and P(S) means that when these processes access and 
modify the semaphore S, they are the only processes accessing and modifying 
S. 


To enter a critical section, a P operation is performed on a common sema¬ 
phore, say S (S is initialized to 1). The first processor performing P(S) will 
be able to enter its critical section. All other processors are blocked since S 
has become 0. After leaving its critical section, the processor performs a V(S), 
thus allowing another processor to execute P(S) successfully. 

The TMS320C30 code for V(S) is shown in Example 7-7, and code for P(S) 
Is shown in Example 7-8. Compare the code in Example 7-8 to the code In 
Example 7-6. 


Example 7-7. Implementation of V(S) 


V: LDII @S,R0 

ADDI 1,R0 
STII R0,@S 


Interlocked read of S begins (XFO 
Contents of S RO 
Increment RO (= S) 

Update Sf end interlock (XFO = 0) 


Example 7-8. Implementation of P(S) 


OR 

4,IOF 

; End interlock 

(XFO = 1) 

LDII 

@S,R0 

; Interlocked read of S begins 



Contents 

of S 

-► RO 

BZ 

P 

If S = 0, 

go to P and try again 

SUBI 

1,R0 

Decrement 

RO 

(= S) 

STII 

R0,@S 

Update S, 

end 

interlock (XFO = 


0 ) 



The SIGI operation may be used to synchronize, at an Instruction level, multi¬ 
ple TMS320C30S. Consider two processors connected as shown in Figure 
7-3 The code for the two processors is shown In Example 7-9 


TMS320C30 #1 TMS320C30 #2 



Figure 7-3. Zero-Logic Interconnect of TMS320C30s 
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Processor #1 runs until it executes the SIGI. It then waits until processor #2 
executes a SIGI. At this point the two processors have synchronized and 
continue execution. 

Example 7-9. Code to Synchronize Two TMS320C30s at the Software Level 


Time Code for TMS320C30 #1 Code for TMS320C30 #2 

0 • • 

9 9 

9 9 

SIGI 9 


(WAIT) 


■ Synchronization Occurs 


ilGI 
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7.4 Reset Operation 

The TMS320C30 supports a nonmaskable external reset signal (RESET), which 
is used to perform system reset. This section discusses the reset operation. 

At powerup, the state of the TMS320C30 processor is undefined. The RESET 
signal Is used to place the processor in a known state. This signal must be 
asserted low for 10 or more HI clock cycles to guarantee a system reset. HI 
is an output clock signal generated by the TMS320C30 (see Appendix A for 
more Information). 

Reset affects the other pins on the device in either a synchronous or asyn¬ 
chronous manner. The synchronous reset is gated by the TMS320C30s in¬ 
ternal clocks. The asynchronous reset directly affects the pins, and Is faster 
than the synchronous reset. Table 7-3 shows the state of the TMS320C30s 
pins after RESET = 0. Each pin is described according to whether the pin Is 
reset synchronously or asynchronously. 

Table 7-3. Pin Operation at Reset 


SIGNAL 

# PINS 

OPERATION AT RESET 

1 PRIMARY INTERFACE (61 PINS) | 

D(31-0) 

32 

Synchronous reset. Placed in high-impedance state. 

A(23-0) 

24 

Synchronous reset. Placed in high-impedance state. 

R/W 

1 

Synchronous reset. Deasserted by going to a high level. 

STRB 

1 

Synchronous reset. Deasserted by going to a high level. 

r5Y 

1 

Reset has no effect. 

HOLD 

1 

Reset has no effect. 

HOLDA 

1 

Reset has no effect. 

1 EXPANSION INTERFACE (49 PINS) | 

XD(31-0) 

32 

Synchronous reset. Placed in high-impedance state. 

XA(12-0) 

13 

Synchronous reset. Placed in high-impedance state. 

XR/W 

1 

Synchronous reset. Deasserted by going to a high level. 

MSTRB 

1 

Synchronous reset. Deasserted by going to a high level. 

lOSTRB 

1 

Synchronous reset. Deasserted by going to a high level. 

XRDY 

1 

Reset has no effect. 

CONTROL SIGNALS (9 PINS) 

RESET 

1 

Reset input pin 


4 

Reset has no effect. 

IaCk 

1 

Synchronous reset. Deasserted by going to a high level. 

MC/W 

1 

Reset has no effect. 

XF(I-O) 

2 

Asynchronous reset. Placed In high-impedance state. 
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Table 7-3. Pin Operation at Reset (Continued) 


SIGNAL # PINS I OPERATION AT RESET 



SERIAL PORT 0 SIGNALS (6 PINS) 

CLKXO 

1 

Asynchronous reset Placed in high-impedance state. 

DXO 

1 

Asynchronous reset Placed In high-impedance state. 

FSXO 

1 

Asynchronous reset. Placed in high-impedance state. 

CLKRO 

1 

Asynchronous reset Placed In high-impedance state. 

DRO 

1 

Asynchronous reset Placed in high-impedance state. 

FSRO 

1 

Asynchronous reset. Placed In high-impedance state. 

I SERIAL PORT 1 SIGNALS (6 PINS) | 

CLKX1 

1 

Asynchronous reset. Placed In high-impedance state. 

DX1 

1 

Asynchronous reset. Placed in high-impedance state. 

FSX1 

1 

Asynchronous reset. Placed in high-Impedance state. 

CLKR1 

1 

Asynchronous reset. Placed in high-Impedance state. 

DR1 

1 

Asynchronous reset Placed in high-Impedance state. 

FSR1 

1 

Asynchronous reset Placed in high-impedance state. 

TIMER 0 SIGNAL (1 PIN) 

TCLKO 1 Asynchronous reset. Placed in high-impedance state. 

TIMER 1 SIGNAL (1 PIN) 

TCLK1 1 Asynchronous reset. Placed in high-impedance state. 

SUPPLY and OSCILLATOR SIGNALS (29 PINS) 

Vdd(3-0) 

4 

Reset has no effect. 

IODVdd(1.0 

2 

Reset has no effect. 

ADVdd(I.O) 

2 

Reset has no effect. 

PDVqd 

1 

Reset has no effect. 

DDVdd(1,0) 

2 

Reset has no effect. 

MDVdd 

1 

Reset has no effect. 

Vss(3-0) 

4 

Reset has no effect. 

DVss(3-0) 

4 

Reset has no effect. 

CVssd.O) 

2 

Reset has no effect. 

IVss 

1 

Reset has no effect. 

VbBP 

1 

Reset has no effect. 

SUBS 

1 

Reset has no effect. 

XI 

1 

Reset has no effect. 

X2/CLKIN 

1 

Reset has no effect. 

HI 

1 

Synchronous reset. Will go to Its initial state when RESET 
makes a 1 to 0 transition. See Appendix A. 

H3 

1 

Synchronous reset. Will go to its initial state when RESET 
makes a 1 to 0 transition. See Appendix A. 
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Table 7-3. Pin Operation at Reset (Concluded) 


SIGNAL 

# PINS 

OPERATION AT RESET 

EMULATION. TEST, and RESERVED (18 PINS) 

EMUO 

FI 4 

Undefined. 

EMU1 

El 5 

Undefined. 

EMU2 

FI 3 

Undefined. 

EMUS 

E14 

Undefined. 

EMU4 

FI 2 

Undefined. 

EMU5 

Cl 

Undefined. 


M6 

Undefined. 

RSVO 

J3 

Undefined. 

RSV1 

J4 

Undefined. 

RSV2 

K1 

Undefined. 

RSV3 

K2 

Undefined. 

RSV4 

LI 

Undefined. 

RSV5 

K3 

Undefined. 

RSV6 

L2 

Undefined. 

RSV7 

K4 

Undefined. 

RSV8 

Ml 

Undefined. 

RSV9 

L3 

Undefined. 

RSV10 

M2 

Undefined. 


At system reset, the following additional operations are performed: 


• The peripherals are reset This is a synchronous operation. The periph¬ 
eral reset is described in Section 9. 

• The following CPU registers are loaded with zero; 

- ST (CPU status register) 

- IE (CPU/DMA Interrupt enable flags) 

IF (CPU interrupt flags) 

lOF (I/O flags) 

• The reset vector Is read from memory location Oh and loaded Into the 
PC. This vector contains the start address of the system reset routine 

• Execution begins. Refer to Section 12 an example of a processor In¬ 
itialization routine. 

Multiple TMS320C30s driven by the same system clock may be reset and 
synchronized. When the 1 to 0 transition of RESET occurs, the processor is 
placed on a well-defined internal phase, and all of the TMS320C30s will come 
up on the same internal phase. 
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7.5 Interrupts 

The TMS320C30 supports multiple internal and external interrupts, which can 
be used for a variety of applications. This section discusses the operation of 
these interrupts. 

A functional diagram of the logic used to implement the external interrupt in¬ 
puts is shown in Figure 7-4; the logic for internal interrupts is similar. Addi¬ 
tional information regarding internal interrupts can be found in Section 9. 


Internal 



EINTn(DMA) 


Figure 7-4. Interrupt Logic Functional Diagram 


External interrupts are synchronized internally as illustrated by the three flip- 
flops clocked by H1 and H3. Once synchronized, the interrupt input will set 
the corresponding Interrupt Flag register (IF) bit if the interrupt is active. 

External interrupts can be effectively either edge- or level-triggered, depending 
on the duration of the low level on the Interrupt Input. An external Interrupt 
must be held low for at least one H1/H3 cycle to be recognized by the 
TMS320C30. If the Interrupt Is held low for between one and three cycles, 
then only one interrupt is recognized. If the interrupt Is held low three or more 
cycles, more than one interrupt may be recognized depending on how rapidly 
interrupts are serviced. 

When a particular interrupt is processed by the CPU or DMA controller, the 
corresponding interrupt flag bit is cleared by the i ntern al interrupt acknowl¬ 
edge signal. It should be noted, however, that if iNTn is still low when the 
interrupt acknowledge signal occurs, t he in terrupt flag bit will only be cleared 
for one cycle and then set again since INTn Is still low. Accordingly, it is the¬ 
oretically possible that , dep ending on when the IF register Is read, this bit may 
be zero even though INTn is zero. When the TMS320C30 is reset, zero is 
written to the interrupt flag register, thereby clearing all pending Interrupts. 
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The interrupt flag register bits may be read and written under software control. 
Writing a 1 to a IF register bit sets the associated interrupt flag to 1. Similiarly, 
writing a 0 resets the corresponding interrupt flag to 0. In this way, all inter¬ 
rupts may be triggered and/or cleared through software. Since the interrupts 
flags may be read, the interrupt pins may be polled in software when an in¬ 
terrupt-driven interface is not required. 

Internal interrupts operate in a similar manner. The bit in the IF register corre¬ 
sponding to an internal interrupt may be read and written through software. 
Writing a 1 sets the interrupt latch, and writing a 0 clears it. All Internal In¬ 
terrupts are one H1/H3 cycle in length. 

The CPU global interrupt enable bit (GIE), located in the CPU status register 
(ST), controls all CPU interrupts. All DMA interrupts are controlled by the 
DMA global interrupt enable bit, which is not dependent upon ST(GIE) and 
is local to the DMA. The DMA global interrupt enable bit is dependent, in 
part, upon the state of the DMA SYNCH bits. It Is not directly accessible 
through software (see Section 9). The AND of the Interrupt flag bit and the 
interrupt enables is then connected to the interrupt processor. 

To provide for maximum performance in servicing Interrupts, the interrupt ac¬ 
knowledge (lACK) instruction is provided. lACK drives the lACK pin and 
performs a dummy read. The read is peformed from the address specified by 
the lACK instruction operand. When lACK Is used, it typically Is placed In the 
early portion of an Interrupt service routine. For certain applications. It may 
be better suited at the end of the Interrupt service routine or be totally unnec¬ 
essary. 

The CPU controls all prioritization of interrupts (see Table 7-4 for reset and 
interrupt vector locations and priorities). If the DMA is not using interrupts 
for synchronization of transfers. It will not be affected by the processing of the 
CPU interrupts. If the CPU is involved in a pipeline conflict (branch, register, 
or memory), it will not respond to the interrupts until that conflict is resolved. 
It is therefore possible to interrupt the CPU and DMA simultaneously with the 
same or different interrupts and, in effect, synchronize their activities. For ex¬ 
ample, it may be necessary to cause a high-priority DMA transfer that avoids 
bus conflicts with the CPU, I.e., make the DMA higher priority than the CPU. 
This may be accomplished using an interrupt that causes the CPU to trap to 
an interrupt routine that contains an IDLE instruction. Then If the same in¬ 
terrupt Is used to synchronize DMA transfers, the DMA transfer counter can 
be used to generate an Interrupt and thus return control to the CPU following 
the DMA transfer. 

Since the DMA and CPU share the same set of Interrupt flags, the DMA may 
clear an interrupt flag before the CPU can respond to it. For example. If the 
CPU interrupts are disabled, the DMA can be responding to Interrupts and 
thus clearing the associated interrupt flags. 
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Table 7-4. Reset and Interrupt Vector Locations 


RESET OR 
INTERRUPT 

VECTOR 

LOCATION 

PRIORITY 

FUNCTION 

RESET 

Oh 

0 

External reset signal input on the RESET 
pin. 

Into 

1h 

1 

External interrupt input on the INTO pin. 

TnTT 

2h 

2 

External interrupt input on the INTI pin. 

TFTf? 

3h 

3 

External interrupt input on the INT2 pin. 

TnT3 

4h 

4 

External interrupt input on the INT3 pin. 

xinto 

5h 

5 

Internal interrupt generated when serial 
port 0 transmit buffer is empty. 

RINTO 

6h 

6 

Internal interrupt generated when serial 
port 0 receive buffer is full. 

XINT1 

7h 

7 

Internal interrupt generated when serial 
port 1 transmit buffer is empty. 

RINTl 

8h 

8 

Internal interrupt generated when serial 
port 1 receive buffer is full. 

TINTO 

9h 

9 

Internal interrupt generated by timer 0. 

TINT1 

OAh 

10 

Internal interrupt generated by timer 1. 

DINT 

OBh 

11 

Internal interrupt generated by DMA con¬ 
troller 0. 


If there is a delayed branch in the pipeline, interrupts are held pending until 
after the branch. If the interrupt occurs in the first cycle of the fetch of an in¬ 
struction, the fetched instruction is discarded (not executed), and the address 
of that instruction is pushed to the top of the system stack. If the interrupt 
occurs after the first cycle of the fetch. In the case of a multicycle fetch due to 
wait states, that instruction Is executed and the address of the next Instruction 
to be fetched is pushed to the top of the system stack. If no program fetch is 
occurring, then no new fetch is performed. After the address of the appropri¬ 
ate instruction has been pushed, the interrupt vector is fetched, loaded into the 
PC, and execution continues. 

The TMS320C30 allows the CPU and DMA to respond to and process inter¬ 
rupts in parallel. Figure 7-5 shows interrupt processing flow. The interrupts 
are polled and the CPU and DMA begin processing them. In the interrupt flow 
pertaining to the the CPU, the interrupt flag corresponding to the hlghest- 
priority enabled interrupt is cleared, and GIE is set to 0. The CPU completes 
all fetched instructions. The interrupt vector Is fetched and loaded Into the PC, 
and the CPU continues execution. The DMA cycle is similar to that for the 
CPU. After the pertinent interrupt flag is cleared, the DMA proceeds based 
upon the status of the SYNCH bits In the DMA global control register. 
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Figure 7-5. 


Interrupt Processing 
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Section 8 


External Bus Operation 


Two external interfaces are provided on the TMS320C30: the primary bus and 
the expansion bus. These are used to access memories and external peripheral 
devices. Software controlled wait states and an external input signal provide 
for wait state generation. 

Major topics discussed in this hardware interface section are listed below. 

• External Interface Control Registers (Section 8.1 on page 8-2) 

- Primary bus 
~ Expansion bus 

• External Interface Timing (Section 8.2 on page 8-5) 

• Programmable Wait States (Section 8.3 on page 8-18) 

• Programmable Bank Switching (Section 8.4 on page 8-20) 
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8.1 External Interface Control Registers 

The TMS320C30 provides two external Interfaces; the primary bus and the 
expansion bus. The primary bus consists of a 32-bit data bus, a 24-bit address 
bus, and a set of control signals. The expansion bus consists of a 32-bit data 
bus, a 13-bit address bus and a set of control signals. Both buses support 
software-controlled wait states and an external ready Input signal. Both buses 
support data, program, and I/O accesses. 

When a primary bus access is performed, STRB is low. The expansion bus 
supports two types of accesses. One Is used primarily for memory accesses 
that are signalled by MSTRB low. The timing for a MSTRB access Is the same 
as that of the STRB access on the parallel Interface. The other type of expan¬ 
sion bus access is commonly used for access of external peripheral devices 
and is signalled by lOSTRB low. 

The primary bus and the expansion bus each have an associated control reg¬ 
ister. These registers are memory-mapped as shown In Figure 8-1. 


REGISTER 

PERIPHERAL 

ADDRESS 

EXPANSION BUS CONTROL 

808060h 

RESERVED 

808061h 

RESERVED 

808062h 

RESERVED 

808063h 

PRIMARY BUS CONTROL 

808064h 

RESERVED 

808065h 

RESERVED 

808066h 

RESERVED 

808067h 

RESERVED 

808068h 

RESERVED 

808069h 

RESERVED 

80806Ah 

RESERVED 

80806Bh 

RESERVED ' 

80806Ch 

RESERVED 

80806Dh 

RESERVED 

80806Eh 

RESERVED 

80806Fh 


Figure 8-1. Memory-Mapped External Interface Control Registers 
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8.1.1 Primary Bus Control Register 

The primary bus control register is a 32-bit register that contains the control 
bits for the primary bus (see Figure 8-2). Table 8-1 lists the register bits with 
the bit names and functions. 


31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 




1031 

IQI 


IQI 

IQI 

33J 

Bll 

HQI 


mi 


XX 1 

XX 1 

15 

14 

13 

12 

11 

10 

9 

8 

7 

6 

5 


H 


1 

0 


IQI 

Bl 

1 BNKCMP 

WTCNT 

SWW 


NOHOLDI 

holdstI 


R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R 

NOTE: XX = reserved bit, read as 0. 

R = read, W = write. 


Figure 8-2. Primary Bus Control Register 


Table 8-1. Primary Bus Control Register Bits Summary 


BIT 

NAME 

FUNCTION 

0 

HOLDST 

Hold status bit. This bit signals if the port is being held 
(HOLDST = 1) or is not being held (HOLDST = 0). This sta¬ 
tus bit is valid whether the port has been held via hardware or 
software. 

1 

NOHOLD 

Port hold signal. NOHOLD allows or disallows the port to be 
held by an external HOLD signal. When NOHOLD = 1, the 
TMS320C30 takes over the external bus and controls it re¬ 
gardless of requests by external devices. No hold acknowledge 
(HOLDA) is asserted when a H6LD is received. However, It 
is asserted if an internal hold is generated (HIZ = 1). NOHOLD 
is set to 0 at reset. 

2 

HIZ 

Internal hold. When set (HIZ = 1), the port is put In hold 
mode. This equivalent to the external HOLD signal. By forcing 
a three-state condition, the TMS320C30 can relinquish the 
external memory port through software. HOLDA goes low 
when the port is three-stated. HIZ is set to 0 at reset. 

■ 

SWW 

Software wait-state generation. In conjunction with WTCNT, 
this 2-bit field defines the mode of wait-state generation. It is 
set to 1 1 at reset. 

■ 

WTCNT 

Software wait mode. This 3-bit field specifies the number of 
cycles to use when in software wait mode for the generation 
of internal wait states. The range is zero (WTCNT = 0 0 0) to 
seven (WTCNT =111) H1/H3 cycles. It is set to 1 1 1 at 
reset. 

8-12 

BNKCMP 

Bank compare. This 5-bit field specifies the number of MSBs 
of the address to be used to define the bank size. It is set to 1 

0 0 0 0 at reset. 

13-31 

Reserved 

Read as 0. 
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8.1.2 Expansion Bus Control Register 


The expansion bus control register is a 32'bit register that contains control 
bits for the expansion bus (see Figure 8-3 and Table 8-2). 


31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 

I 


XX 1 

XX 1 

XX 1 

1 XX 1 

1 XX 1 

1 XX 1 

1 XX 

dT 

XX 

1 xx 1 

XX 1 

XX 1 

1 XX 1 

XX 1 

15 

14 

13 

12 

11 

10 

9 

8 

7 

6 

5 

4 

3 

2 

1 

0 

I ^ 

IxxJ 

XX 1 

XX 1 

rrri 

1 XX 1 

1 XX 1 

1 XX 1 

d 

WTCNT 

d 

1 sww 1 

XX 1 

1 XX 1 

XX 1 


R/W R/W R/W R/W R/W 


NOTE; XX = reserved bit, read as 0. 
R = read, W = write. 


Figure 8-3. Expansion Bus Control Register 


Table 8-2. Expansion Bus Control Register Bits Summary 


BIT 

NAME 

FUNCTION 

0-2 

Reserved 

Read as 0. 

3-4 

SWW 

Software wait-state generation. In conjunction with the 
WTCNT, this 2-bit field defines the mode of wait-state gener¬ 
ation. It is set to 1 1 at reset. 

5-7 

WTCNT 

Software wait mode. This 3-bit field specifies the number of 
cycles to use when in software wait mode for the generation 
of internal wait states. The range is zero (WTCNT = 0 0 0) to 
seven (WTCNT =111) H1/H3 clock cycles. It is set to 1 1 

1 at reset. 

8-31 

Reserved 

Read as 0. 
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8.2 External Interface Timing 

This section discusses functional timing of operations on the primary bus and 
the expansion bus, the TIVIS320C30's two independent parallel buses. De¬ 
tailed timing specifications for all TMS320C30 signals are contained in Ap¬ 
pendix A, the TMS320C30 Data Sheet. 

The parallel buses implement three mutually exclusive address spaces distin- 
quished through the use of three separate control signals: STRB,MSTRB, and 
lOSTRB. The STRB signal controls accesses on the primary bus, and the MSTRB 
and lOSTRB control accesses on the expansion bus. Since the two buses are 
independent, two accesses may be made in parallel. 

With the exception of bank switching and the external HOLD function (dis¬ 
cussed later in this section), timing of primary bus cycles and MSTRB expan¬ 
sion bus cycles are identical, and will be discussed collectively. The acronym 
(M)STRB will be used In references which pertain equally to STRB and 
MSTRB. Similarly (X)R/W, (X)A, (X)D, and (X)RDY are used to symbolize the 
equivalent primary and expansion bus signals. The lOSTRB expansion bus 
cycles are timed differently and will be discussed independently. 


8.2.1 Primary Bus Cycles 

All bus cycles comprise integral numbers of H1 clock cycles. One HI cycle is 
defined to be from one falling edge of H1 to the next falling edge of HI. For 
full speed (zero wait state) accesses, reads take one HI cycle, while writes 
take two cycles, unless the write follows a read. In which case the write takes 
three cycles. Recall that internally (from the perspective of the CPU and 
DMA) writes require only one cycle If no accesses to that Interface are In 
progress. The following discussions pertain to zero wait state accesses unless 
otherwise specified. 

The (M)STRB signal Is low for the active portion of both reads and writes, 
which las ts one H1 cycle. Additionally before and after the active portion 
((M)STRB low) of writes only, there is a transition cycle of HI. During this 
transition cycle, the following occur: 

1 ) (M)STRB Is high. 

2) If required, (X)R/W changes state on HI rising. 

3) If required, addresses changes on HI rising If the previous H1 cycle was 
the active portion of a write. If the previous H1 cycle was a read, address 
changes on H1 falling. 

Figure 8-4 illustrates a read-read-write sequence for (M)STRB active and no 
wait states. The data is read as late in the cycle as possible to allow for the 
maximum access time from address valid. Note that although external writes 
take two cycles, internally (from the perspective of the CPU and DMA), they 
require only one cycle if no accesses to that interface are in progress. In the 
typical timing for ail external interfaces, the (X)R/W strobe does not change 
until (M)STRB or lOSTRB goes inactive. 
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Figure 8-5 llustrates a write-write-read sequence for (M)STRB active and no 
wait states. The address and data written are held valid approximately one- 
half cycle after (M)STRB changes. 
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Figure 8-6 illustrates a read cycle with one wait state. Since (X)RDY = 
1 , the read cycle is extended. ( M)STRB, (X)R/W, and (X)A are also ex¬ 
tended one cycle. The next time (X)RDY is sampled, It Is 0. 
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Figure 8-7 illustrates a write cycle with o ne wait s tate, ^ce initially 
(X)RDY = 1, the write cycle is extend ed. (M) STRB, (X)R/W, and (X)A 
are extended one cycle. The next time (X)RDY is sampled, it is 0. 



Figure 8-7. Use of Wait States for Write for (M)STRB = 0 
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8.2.2 Expansion Bus I/O Cycles 

In contrast to primary bus and MSTRB cycles, lOSTRB reads and writes are both 
two cycles In duration (with no wait states) and exhibit the same timing. 
During these cycles, address always changes on the falling edge of HI, and 
lOSTRB is low from the rising edge of the first HI cycle to the rising edge of 
the second H1 cycle. The lOSTRB signal always goes Inactive (high) between 
cycles, and XR/W is high for reads and low for writes. 

Figure 8-8 illustrates read and write cycles when lOSTRB is active and there 
are no wait states. For lOSTRB accesses, reads and writes require a rninimum 
of two cycles. Some off-chip peripherals may change their status bits when 
read or written. Therefore, it is important that valid addresses be maintained 
when communicating with these peripherals. For reads and writes when 
lOSTRB is active, lOSTRB is completely framed by the address. 
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-extra cycle- 


Figure 8-9. Read with One Wait-State for lOSTRB = 0 
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Figure 8-11. Memory Read and I/O Write for Expansion Bus 
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Figure 8-13. Memory Write and I/O Read for Expansion Bus 
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Figure 8-14 and Figure 8-15 illustrate the signal states when a bus is inactive 
(after a lOSTRB or (M)STRB access respectively). The strobes (STRB, MSTRB, 
lOSTRB) and (X)R/W go to 1. The address is undefined, and the ready signal 
(XRDY or RDY) is Ignored. 



Figure 8>14. Inactive Bus States for lOSTRB 
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Figure 8-15. Inactive Bus States for STRB and MSTRB 
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Figure 8-16 illustrates the timing for HOLD and HOLDA. HOLD is an external 
asynchronous Input. There is a mi nimum of one cycl e delay from when the 
processor recognizes HOLD=0 until HOLDA=0. When HOLDA=0, the address, 
data buses, and associated strobes are placed In a high-impedance state. All 
accesses occurring over an interface are completed before a hold is acknowl¬ 
edged. 



Figure 8-16. HOLD and HOLDA Timing 
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8.3 Programmable Wait States 

Both the parallel and expansion interfaces allow the control of wait-state 
generation through the manipulation of their associated memory-mapped 
control registers. The SWW field is used to select the mode of wait-state 
generation, and the WTCNT field is used to load an Internal timer used In the 
generation of wait states. The following four modes of wait-state generation 
can be used: 

• External RDY 

• WTCNT-generated RDYwtcnt 

• Logical-AND of RDY and R^wtcnt 

• Logical-OR of RDY and RDY^tcnt 

These four modes are u sed in the ge nerati on of the internal ready signal that 
controls accesses, RDY jpf As long as RDYjnt = 1/ the current external access 
Is delayed. When RDYjpt = 0, the current access completes. Since the use of 
programmable wait states for both external Interfaces is identical, only the 
primary bus Interface is described in the following paragraphs. 

RDYwtcnt is an internally generated ready signal. When an external access is 
begun, the value in WTCNT is loaded into a counter. WTCNT may be any 
value from 0 through 7. The counter is decremented every H1 /H3 clock cycle 
until It becomes 0. Once the counter is set to 0 , it remains set to 0 until the 
next acce ss. While the counter is nonzero, RDYwtcnt ^ • While the counter 
is 0, RDYyy/'tcnt ~ 

When SWW = 0 0, RDYjnt is only dependent upon RDY. RDYwtcnt is ignored. 
The truth table for this mode is shown In Table 8-3. 
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Table 8-3. Wait-State Generation When SWW = 00 


RDY 



0 

0 

0 

0 

1 

0 

1 

0 

1 

1 

1 

1 


When SWW = 0 1, RDYjnt is only dependent upon RDY^tcnt- is ignored. 
Table 8-4 shows the truth table for this mode. 

Table 8-4. Wait-State Generation When SWW = 01 


RDY 

IlsIsSffiSflll 


0 

0 

0 

0 

1 

1 

1 

0 

0 

1 

1 

1 


When SWW = 10, RDYjn t is the lo gical-OR (electrical-AND, since these sig¬ 
nals are low true) of RDY and RDY^tcnt (see Table 8-5). 

Table 8-5. Wait-State Generation When SWW = 10 


RDY 


■HaiM 

0 

0 

0 

0 

1 

0 

1 

0 

0 

1 

1 

1 


When SWW = 11, RDYj nt is the lo gical-AND (electrical-OR, since these sig¬ 
nals are low true) of RDY and RDY^tcnt The truth table for this mode is shown 
in Table 8-6. 

Table 8-6. Wait-State Generation When SWW = 11 


RDY 

RDY^tcnt 

RDVin, 

0 

0 

0 

0 

1 

1 

1 

0 

1 

1 

1 

1 
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8.4 Programmable Bank Switching 

Programmable bank switching provides the capability of switching between 
external memory banks without the need for externally inserting wait states 
due to memories requiring several cycles to turn off. Bank switching is im¬ 
plemented on the primary bus and not on the expansion bus. 

The size of a bank is determined by the number of bits specified to be exam¬ 
ined. For example (see Figure 8-17), if BNKCMP =16, the 16 MSBs of the 
address are used to define a bank. Since addresses are 24 bits, the bank size 
is specified by the 8 LSBs, yielding a bank size of 256 words in this case. If 
BNKCMP ^16, only the 16 MSBs are compared. Banksizes from 2® = 256 
to 2^4 = 16M are allowed. Table 8-7 summarizes the relationship between 
BNKCMP, the address bits used to define a bank, and the resulting bank size. 


■ 24-bit Address - 


23 


8 7 




Defines 


Number of bits to compare— 

bank size 

Figure 8-17. BNKCMP Example 



Table 8-7. BNKCMP and Bank Size 


BNKCMP 

MSBs DEFINING A BANK 

BANK SIZE (32-BIT WORDS) 

00000 

None 

224 = 16M 

00001 

23 

223 = 8M 

00010 

23-22 

222 = 4IVI 

00011 

23-21 

221 = 2M 

00100 

23-20 

220 = 1 VI 

00101 

23-19 

219 = 512K 

00110 

23-18 

218 = 256K 

00111 

23-17 

217 = 128K 

01000 

23-16 ' 

216 = 64K 

01001 

23-15 

216 = 32K 

01010 

23-14 

214 = i6K 

01011 

23-13 

213 = 8K 

01100 

23-12 

212 = 4K 

01101 

23-11 

211 = 2K 

OHIO 

23-10 

2l0 = IK 

01111 

23-9 

29 =512 

10000 

23-8 

28 = 256 

10001 

Reserved 

Undefined 

through 



11111 
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Internal to the TMS320C30 is a register that contains the MSBs (as defined 
by the BNKCMP field) of the last address used for a read or write over the 
primary interface. At reset the register bits are set to zero. If the MSBs of the 
address being used for the current primary interface read do not match those 
contained in this internal register, a read cycle is not asserted for one H1/H3 
clock cycle. During this extra clock cycle, the address bus switches over to the 
new address, but STRB is inactive (high). The contents of the internal register 
are replaced with the MSBs being used for the current read of the current ad¬ 
dress. If the MSBs of the address being used for the current read match the 
bits in the register, a normal read cycle takes place. 

If repeated reads are performed from the same memory bank, no extra cycles 
are inserted. When reading from a different memory bank, memory conflicts 
are avoided by the insertion of an extra cycle. This feature can be disabled 
by setting BNKCMP to 0. The Insertion of the extra cycle occurs only when a 
read is performed. The changing of the MSBs in the internal register occurs 
for ail reads and writes over the primary interface. 

Figure 8-18 illustrates the addition of an inactive cycle when switches be¬ 
tween banks of memory occur. 


8 


8-22 




Hardware Interface - Programmable Bank Switching 



|«i—extra cycle 



Figure 8-18. Bank Switching Example 
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Section 9 


Peripherals 


The TMS320C30 provides two timers, two serial ports, and an on-chip Direct 
Memory Access (DMA) controller. These peripheral modules are manipulated 
through memory-mapped registers located on the dedicated peripheral bus. 

The DMA controller is used to perform input/output operations without in¬ 
terfering with the operation of the CPU. Therefore, it is possible to interface 
the TMS320C30 to slow external memories and peripherals (A/D's, serial 
ports, etc.) without reducing the computational throughput of the CPU. The 
result is improved system performance and decreased system cost. 

Major topics discussed in this section on peripherals are listed below. 

• Timers (Section 9.1 on page 9-2) 

- Registers 

- Pulse generation 

- Operation modes 

• Serial Ports (Section 9.2 on page 9-9) 

- Registers 

- Operation configurations 
” Timing 

• DMA Controller (Section 9.3 on page 9-26) 

- Registers 

- VA memory transfer operation 

“ Synchronization of DMA channels 
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9.1 Timers 

The TMS320C30 timer modules are general-purpose 32-bit timer/event 
counters, with two signalling modes and internal or external clocking (see 
Figure 9-1). The timer modules can be used to signal to the TMS320C30 or 
the external world at specified intervals, or to count external events. With an 
internal clock, the timer can be used to signal an external A/D converter to 
start a conversion, or it can interrupt the TMS320C30 DMA controller to begin 
a data transfer. With an external input, the timer can count external events and 
Interrupt the CPU after a specified number of events. Available to each timer 
is an I/O pin that can be used either as an input clock to the timer, an output 
clock signal, or a general-purpose I/O pin. 



TIMER OUT 


Figure 9-1. Timer Block Diagram 


Three memory-mapped registers are used by each timer. They are: 

• Global control register 

• Period register 

• Counter register 

The global control register determines the operating mode of the timer, moni¬ 
tors the timer status, and controls the function of the I/O pin of the timer. The 
period register specifies the timer's signalling frequency. The counter register 
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contains the current value of the incrementing counter. The timer can be in¬ 
cremented on the rising edge or the falling edge of the input clock. The 
counter is zeroed whenever its value equals that in the period register. The 
pulse generator generates two types of external clock signals: pulse or clock. 
The memory map for the timer modules is shown in Figure 9-2. 


Register Peripheral Address 



Timer 0 

Timer 1 

TIMER GLOBAL CONTROL REGISTER 

808020h 

808030h 

RESERVED 

808021h 

808031h 

RESERVED 

808022h 

808032h 

RESERVED 

808023h 

808033h 

TIMER COUNTER REGISTER 

808024h 

808034h 

RESERVED 

808025h 

808035h 

RESERVED 

808026h 

808036h 

RESERVED 

808027h 

808037h 

TIMER PERIOD REGISTER 

808028h 

808038h 

RESERVED 

808029h 

808039h 

RESERVED 

80802Ah 

80803Ah 

RESERVED 

80802Bh 

80803Bh 

RESERVED 

80802Ch 

80803Ch 

RESERVED 

80802Dh 

80803Dh 

RESERVED 

80802Eh 

80803Eh 

RESERVED 

80802Fh 

80803Fh 


Figure 9-2. Memory-Mapped Timer Locations 
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9.1.1 Timer Global Control Register 

The timer global control register is a 32-bit register that contains the global 
and port control bits for the timer module. Table 9-1 defines the register bits, 
names, and functions. Bits 3-0 are the port control bits. Bits 11 -6 are the 
timer global control bits. Figure 9-3 shows the 32-bit register. Note that at 
reset, all bits are set to 0 except for DATIN (set to the value read on TCLK). 
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Table 9-1. Timer Global Control Register Bits Summary 


BITS 

NAME 

FUNCTION 

0 

FUNC 

FUNC controls the function of TCLK. If FUNC = 0, TCLK is con¬ 
figured as a general-purpose digital 1/0 port. If FUNC = 1, TCLK 
is configured as a timer pin (see Figure 9-6) for a description of the 
relationship between FUNC and CLKSRC. 

■ 

T/0 

If FUNC == 0 and CLKSRC = 0, TCLK is configured as a general- 
purpose I/O pin. In this case, if T/0 = 0, TCLK Is configured as a 
general-purpose input pin. If T/0 = 1, TCLK is configured as a 
general-purpose output pin. 

2 

DATOUT 

DATOUT drives TCLK when In I/O port mode. DATOUT can also 
be used as an input to the timer. 

3 

DATIN 

Data input on TCLK or DATOUT. A write has no effect. 

■a 

Reserved 

Read as 0. 

6 

GO 

The GO bit resets and starts the timer counter. When GO = 1 and 
the timer is not held, the counter is zeroed and begins incrementing 
on the next rising edge of the timer input clock. The GO bit is 
cleared on the same rising edge. GO = 0 has no effect on the timer. 

1 

HLD 

Counter hold signal. When this bit is zero, the counter is disabled 
and held In its current state. If the timer is driving TCLK, the state 
of TCLK Is also held. The Internal divide-by-two counter is also 
held so that the counter can continue where it left off when HLD 
is set to 1. The timer registers can be read and modified while the 
timer is being held. RESET has priority over HLD. Table 9-2 shows 
the effect of writing to GO and HLD. 

8 

C/P 

Clock/Pulse mode control. When C/P = 1, clock mode is chosen 
and the signalling of the status flag and external output will have a 
50 percent duty cycle. When C/P = 0, the status flag and external 
output will be active for one HI cycle during each timer period (see 
Figure 9-4). 

9 

CLKSRC 

Specifies the source of the timer clock. When CLKSRC = 1, an in¬ 
ternal clock with frequency equal to one-half the HI frequency is 
used to increment the counter. The INV bit has no effect on the 
internal clock source. When CLKSRC = 0, an external signal from 
the TCLK pin can be used to increment the counter. The external 
clock is synchronized Internally, thus allowing external asynchro¬ 
nous clock sources not exceeding the specified maximum allowable 
external clock frequency. This will be less than f(H1)/2. (See Fig¬ 
ure 9-6 for a description of the relationship between FUNC and 
CLKSRC). 

10 

INV 

Inverter control bit. If an external clock source is used and INV = 

1, the external clock is inverted as it goes into the counter. If the 
output of the pulse generator is routed to TCLK and INV = 1, the 
output is inverted before it goes to TCLK (see Figure 9-1). If INV 
= 0, no inversion Is performed on the input or output of the timer. 
The INV bit has no effect, regardless of its value, when TCLK is used 
in I/O port mode. 

11 

TSTAT 

This bit indicates the status of the timer. It tracks the output of the 
uninverted TCLK pin. This flag sets a CPU interrupt on a 
transition from 0 to 1. A write has no effect. 

12-31 

Reserved 

Read as 0. 
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31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 





XX 

Q| 

XX 

IQ 

XX 


IQ2I 


XX 

XX 

IQJI 


15 

14 

13 

12 

11 

10 

9 

8 

7 

6 

5 

4 

3 

2 

1 

0 

d 

Q[ 

Qj 

IQ 

TSTAT 


CLKSRC 

Baa 

HLD 

13*1 



DATIN 

DATOUT 

BBl 

FUNC 





R/W 

R/W 

R/W 

R/W 

R/W 

R/W 



R 

R/W 

R/W 

R/W 


NOTE: XX = Reserved bit, read as 0. 
R = read, W = write. 


Figure 9-3. Timer Global Control Register 


Table 9-2 shows the result of a write using specified values of the GO and HLD 
bits in the global control register. 

Table 9-2. Result of a Write of Specified Values of GO and HLD 


GO 

HLD 

RESULT 

0 

0 

All timer operations are held. No reset is performed. 

0 

1 

Timer proceeds from state before write. 

1 

0 

All timer operations are held. Including zeroing of the counter. The GO 
bit is not cleared until the timer Is taken out of hold. 

1 

1 

Timer reset and started. 


9.1.2 Timer Period and Counter Registers 

The 32-bit timer period register is used to specify the frequency of the timer 
signalling. The timer counter register is a 32-bit register, which is reset to zero 
whenever it increments to the value of the period register. Both registers are 
set to 0 at reset. 

Certain boundary conditions affect timer operation, such as a zero in the pe¬ 
riod register and overflowing the counter. These conditions are listed as fol¬ 
lows: 

• When the period and counter Registers are zero, the operation of_the 
timer is dependent upon the C/P mode selected. In pulse mode (C/P = 
0), TSTAT is set and remains set. In clock mode (C/P = 1), the width 
of the cycle is 2/f(H1) and the external clocks are ignored. 

• When the counter register is not 0 and the period register = 0, the 
counter will count, roll over to 0, and then behave as described above. 

• When the counter register is set to a value greater than the period regis¬ 
ter, the counter may overflow when being Incremented. Once the 
counter reaches its maximum 32-blt value (OFFFFFFFFh), it simply 
clocks over to 0 and continues. 

Writes from the peripheral bus override register updates from the counter or 
new status updates to the control register. 
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9.1.3 Timer Pulse Generation 

The timer pulse generator (see Figure 9-1) can generate several different ex¬ 
ternal signals. These signals may be Inverted with the INV bit. The two basic 
modes are pulse mode and clock mode, as shown In Figure 9-4. In both 
modes, an internal clock source has a frequency of f(H1)/2, and an external 
clock source has a maximum frequency less than f(H1)/2. Refer to timer 
timing In Appendix A. In pulse mode (C/P = 0), the width of the pulse is 
1/f(H1). 


-fc4-2/f(H1) 






_r 

t- 




- 










! 


U-^1/f(CLKSRC) j 

1^—period register/f(CLKSRC)—^ 


(a) TSTAT AND TIMER OUTPUT (INV = 0) WHEN C/P = 0 (PULSE MODE) 
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-M-1/f(CLKSRC) 

■H-!—2/f(H1) 

I ' 



|i«- period register/fiCLKSRC) —*4 j 

1^-2 X period register/f(CLKSRC)-^ 


(b) TSTAT AND TIMER OUPUT (INV = 0) WHEN C/P = 1 (CLOCK MODE) 


Figure 9-4. Timer Timing 
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The rate of timer signaling is determined by the frequency of the timer input 
clock and the period register. The following equations are valid with either 
an internal or an external timer clock: 

f(pulse mode) = f(timer clock)/perlod register 
f(clock mode) = f(timer clock)/(2 x period register) 


9.1.4 Timer Operation Modes 

The timer can receive its input and send its output in several different modes, 
depending upon the setting of CLKSRC, FUNC, and I/O. The four timer 
modes of operation are defined as follows: 


• If CLKSRC = 1 and FUNC = 0, the timer input comes from the Internal 
clock. The internal clock is not affected by the INV bit. In this mode, 
TCLK Is connected to the I/O port control and can be used as a gener¬ 
al-purpose I/O pin (see Figure 9-5). If I/O = 0, TCLK Is configured as 
a general-purpose input pin whose state can be read In DATIN. DAT- 
OUT has no effect on TCLK or DATIN. If I/O = 1, TCLK is configured 
as a general-purpose output pin. DATOUT Is placed on TCLK and can 
be read in DATIN. 


INTERNAL 


EXTERNAL 


DATOUT(NC)-o 


TCLK 


DATIN 

I/O = 0 
(a) 


INTERNAL 


DATOUT- 


♦ 

DATIN 


EXTERNAL 

-TCLK 


I/O = 1 
(b) 

Figure 9-5. Timer I/O Port Configurations 



• If CLKSRC = 1 and FUNC = 1, the timer input comes from the Internal 
clock and the timer output goes to TCLK. This value may be inverted 
using INV, and the value output on TCLK can be read In DATIN. 

• If CLKSRC = 0 and FUNC = 0, the timer is driven according to the sta¬ 
tus of the I/O bit. If I/O = 0, the timer input comes from TCLK. This value 
can be inverted using INV, and the value of TCLK can be read in DATIN. 
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If T/0 = 1, TCLK is an output pin. Then TCLK and the tinner are both 
driven by DATOUT. All 0 to 1 transitions of DATOUT increment the 
counter. INV has no effect on DATOUT. The value of DATOUT can be 
read in DATIN. 

• If CLKSRC = 0 and FUNC = 1, TCLK drives the timer. If INV = 0, all 0 
to 1 transitions of TCLK increment the counter. If INV = 1, all 1 to 0 
transitions of TCLK Increment the counter. The value of TCLK can be 
read in DATIN. 

Figure 9-6 shows the four timer modes of operation. 


TIMER 


INTERNAL 


TSTAT 


I/O PORT 
CONTROL 


CLKSRC = 1 (INTERNAL) 
FUNC = 0 (I/O PIN) 

(a) 



NTERNAL 


TIMER IN 

—CLOCK 

TIMER IN 

TIMER OUT 

- I 

-[-►TCLK 

TIMER OUT 



1 j 



EXTERNAL 


TIMER 


INTERNAL 

INTERNAL 

"CLOCK 


TSTAT 


DATIN 


CLKSRC = 1 (INTERNAL) 
FUNC = 1 (TIMER PIN) 

(b) 


EXTERNAL 


TCLK 
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CLKSRC = 0 (EXTERNAL) CLKSRC = 0 (EXTERNAL) 

FUNC = 0 (I/O PIN) FUNC = 1 (TIMER PIN) 


(c) 


(d) 


Figure 9-6. Timer Modes as Defined by CLKSRC and FUNC 
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9.2 Serial Ports 

The two TMS320C30 serial ports are totally independent. Both serial ports 
are identical with a complementary set of control registers in each one. Each 
serial port can be configured to transfer 8, 16, 24, or 32 bits of data per word. 
The clock for each serial port can originate either internally or externally. An 
internally generated clock is a divide-down of the clockout frequency (H1). 
A continuous transfer mode is available which allows the serial port to transmit 
and receive any number of words without new synchronization pulses. 

Eight memory-mapped registers are provided for each serial port. They are: 

• Port global control register 

• Two port control registers for the six I/O pins 

• Three port receive/transmit timer registers 

• Data transmit register 

• Data receive register 

The global control register controls the global functions of the serial port and 
determines the serial port operating mode . Two port control registers control 
the functions of the six serial port pins. The transmit buffer contains the next 
complete word to be transmitted. The receive buffer contains the last com¬ 
plete word received. Three additional registers are associated with the 
transmit/receive sections of the serial port timer. A serial port block diagram 
is shown in Figure 9-7, and the memory map of a serial port is shown in Figure 
9-8. 
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-RECEIVE SECTION- 


RECEIVE 
TIMER (16) 


CLKR 
TSTAT 


CLKR 


RINT 


n 




RECEIVE 

CLOCK 


BIT COUNTER 
(8/16/24/32) 


RSR 

(32) 


DR DR 


FSX 

J- 


FSX 


LOAD 

CONTROL 


DRR 

(32) 


LOAD 



Figure 9-7. Serial Port Block Diagram 
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Register 

Peripheral Address 


Serial 

Serial 


Port 0 

Port 1 

PORT GLOBAL CONTROL REGISTER 

808040h 

808050h 

RESERVED 

808041h 

808051h 

FSX/DX/CLKX PORT CONTROL REGISTER 

808042h 

808052h 

FSR/DR/CLKR PORT CONTROL REGISTER 

808043h 

808053h 

R/X TIMER CONTROL REGISTER 

808044h 

808054h 

R/X TIMER COUNTER REGISTER 

808045h 

808055h 

R/X TIMER PERIOD REGISTER 

808046h 

808056h 

RESERVED 

808047h 

808057h 

DATA TRANSMIT REGISTER 

808048h 

808058h 

RESERVED 

808049h 

808059h 

RESERVED 

80804Ah 

80805Ah 

RESERVED 

80804Bh 

80805Bh 

DATA RECEIVE REGISTER 

80804Ch 

80805Ch 

RESERVED 

80804Dh 

80805Dh 

RESERVED 

80804Eh 

80805Eh 

RESERVED 

80804Fh 

80805Fh 


Figure 9-8. Memory-Mapped Locations for the Serial Port 


9.2.1 


Serial Port Global Control Register 

The serial port global control register is a 32-bit register that contains the 
global control bits for the serial port. Table 9-3 defines the register bits, bit 
names, and bit functions. The register is shown in Figure 9-9. 
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Table 9-3. Serial Port Global Control Register Bits Summary 

BIT NAME FUlMCTiON 

0 RRDY If RRDY = 1, the receive buffer has new data and is ready to be read. A three 

H1 /H3 cycle delay occurs from the reading of DRR to RRDY = 1. The rising edge 
of this signal sets RINT. If RRDY = 0, the receive buffer does not have new data 
_ since the last read. RRDY is set to 0 at reset and after the receive buffer is read. 

1 XRDY If XRDY = 1, the transmit buffer has written the last bit of data to the shifter and 

is ready for a new word. A three H1/H3 cycle delay occurs from the loading of 
the transmit shifter to XRDY being set to 1. The rising edge of this signal sets 
XINT. If XRDY = 0, the transmit buffer has not written the last bit of data to the 
transmit shifter and is not ready for a new word. XRDY is set to 1 at reset. 

2 FSXOUT This bit configures the FSX pin as an input (FSXOUT = 0) or an output (FSXOUT 

__ zH _ 

3 XSREMPTY If XSREMPTY = 0, the transmit shift register is empty. If XSREMPTY = 1, the 

transmit shift register is not empty. This bit is set to 0 at reset or by an XRESET. 

4 RSRFULL If RSRFULL = 1, an overrun of the receiver has occurred. In continuous mode, 

RSRFULL is set to 1 when both RSR and DRR are full. In noncontinuous mode, 
RSRFULL Is set to 1 when RSR and DRR are full and a new FSR is received. A 

read causes this bit to be set to 0. This bit can only be set to 0 by a system reset, 
a serial port receive reset (RRESET = 1), or a read. When the receiver tries to set 
RSRFULL to a 1 at the same time that the global register is read, the receiver will 
dominate and RSRFULL is set to 1. If RSRFULL = 0, no overrun of the receiver 
_ has occurred. _ 

5 HS If HS = 1, the handshake mode is enabled. If HS = 0, the handshake mode is 

disabled. 

’~6 XCLKSRCE If XCLKSRCE = 1, the internal transmit clock is used. If XCLKSRCE = 0, the 

_ external transmit clock is used. _ 

7 RCLKSRCE If RCLKSRCE = 1, the internal receive clock Is used. If RCLKSRCE = 0, the ex¬ 

ternal receive clock is used. 

8 XVAREN This bit specifies fixed (XVAREN = 0) or variable (XVAREN = 1) data rate sig¬ 

nalling when transmitting. With a fixed data rate, FSX Is active for at least one 
XCLK cycle, and then goes inactive before transmission begins. With variable 
data rate, FSX is active while all bits are being transmitted. When using an ex¬ 
ternal FSX and variable data rate signaling, the DX pin is driven by the transmitter 
_when FSX is held active or when a word Is being shifted out. 

9 RVAREN This bit specifies fixed (RVAREN = 0) or variable (RVAREN = 1) data rate sig¬ 

nalling when receiving. With a fixed data rate, FSR is active for at least one RCLK 
cycle, and then goes inactive before the reception begins. With variable data rate, 
_ FSR is active while all bits are being received. _ 

10 XFSM Transmit frame sync mode. Configures the port for continuous mode operation 

(XFSM = 1) or standard mode (XFSM = 0). In continuous mode, only the first 
word of a block generates a sync pulse, and the rest are simply transmitted con¬ 
tinuously to the end of the block. In standard mode, each word has an associated 
_ sync pulse. _ 

11 RFSM Receive frame sync mode. Configures the port for continuous mode (RFSM = 

1) or standard mode (RFSM = 0) operation. In continuous mode, only the first 
word of a block generates a sync pulse, and the rest are simply received contin¬ 
uously without expectation of another sync pulse. In standard mode, each word 
_ received has an associated sync pulse. _ 

12 CLKXP CLKX polarity. If CLKXP = 0, CLKX is active high. If CLKXP = 1, CLKX is active 

low. 
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Table 9-3. Serial Port Global Control Register Bits Summary (Concluded) 


BIT 

NAME 

FUNCTION 

13 

CLKRP 

CLKR polarity. If CLKRP = 0, CLKR is active high. If CLKRP = 1, CLKR is active 
low. 

14 

DXP 

DX polarity. If DXP = 0, DX is active high. If DXP = 1, DX is active low. 

15 

DRP 

DR polarity. If DRP = 0, DR is active high. If DRP = 1, DR is active low. 

16 

FSXP 

FSX polarity. If FSXP = 0, FSX is active high. If FSXP = 1, FSX Is active low. 

17 

FSRP 

FSR polarity. If FSRP = 0, FSR Is active high. If FSRP = 1, FSR Is active low. 

18-19 

XLEN 

This bit defines the word length of serial data transmitted. All data is assumed 
to be right-justified in the transmit buffer when fewer than 32 bits are specified. 

0 0 --- 8 bits 1 0 --- 24 bits 

0 1 -- 16 bits 1 1 —32 bits 

20-21 

RLEN 

This bit defines the word length of serial data received. All data is right-justified 
in the receive buffer. 

0 0 --- 8 bits 1 0 --- 24 bits 

0 1 -- 16 bits 1 1 --- 32 bits 

22 

XTINT 

Transmit timer interrupt enable. If XTINT = 0, the transmit timer Interrupt is dis¬ 
abled. If XTINT = 1, the transmit timer interrupt is enabled. 

23 

XI NT 

Transmit interrupt enable. If XINT = 0, the transmit interrupt Is disabled. If XINT 
= 1, the transmit interrupt is enabled. Note that the CPU transmit interrupt flag 
XINT is the logical OR of the enabled transmit timer interrupt and the enabled 
transmit interrupt. 

24 

RTINT 

Receive timer interrupt enable. If RTINT = 0, the receive timer interrupt is disa¬ 
bled. If RTINT = 1, the receive timer interrupt is enabled. 

25 

RINT 

Receive interrupt enable. If RINT = 0, the receive interrupt is disabled. If RINT 
= 1, the receive interrupt Is enabled. Note that the CPU receive interrupt flag 
RINT is the OR of the enabled receive timer interrupt and the enabled receive 
interrupt 

26 

XRESET 

Transmit reset. If XRESET = 0, the transmit side of the serial port is reset. To take 
the transmit side of the serial port out of reset, XRESET should be set to 1. 
However, XRESET should not be set to 1 until at least three cycles after XRESET 
goes inactive. This applies only to system reset. Setting XRESET to 0 does not 
change the contents of any of the serial port control registers. It places the 
transmitter in a state corresponding to the beginning of a frame of data. Resetting 
the transmitter generates a transmit interrupt. This bit should be set at the same 
time the mode of the transmitter is set. XFSM can be toggled without resetting 
the global control register. 

27 

PRESET 

Receive reset. If PRESET = 0, the receive side of the serial port is reset. To take 
the transmit side of the serial port out of reset, XRESET should be set to 1. Set¬ 
ting PRESET to 0 does not change the contents of any of the serial port control 
registers. It places the receiver in a state corresponding to the beginning of a 
frame of data. This bit should be set at the same time the mode of the receiver 
is set. RFSM can be toggled without resetting the global control register. 

28-31 

Reserved 

Read as 0. 
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31 

30 

29 

28 

27 


26 

25 


24 

23 

22 


21 

20 

19 18 

17 

16 



XX 

mm\ 

RRESET 

XRESET 


RTINT 

QI2Q 


RLEN 

XLEN 


1^33 





R/W 


R/W 

R/W 

R/W 

R/W 

R/W 


R/W R/W 

R/W R/W 

R/W 

R/W 

15 

14 

13 

12 

11 


10 

9 


8 

7 

6 


5 

4 

3 

2 

1 

0 

DRP 

DXP 

CLKRP 

CLKXP 

RFSM 

XFSM 

RVAREN 


RCLK 

SRCE 

XCLK 

SRCE 

jg 

RSR 

FULL 

XSR 

EMPTY 

FSXOUT 

XRDY 

RRDY 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 


R/W 

R/W 

R/W R/W R 

R 

R/W 

R 

R 


NOTE: XX = Reserved bit, read as 0. 

R = read, W = write. 

Figure 9-9. Serial Port Global Control Register 


9.2.2 FSX/DX/CLKX Port Control Register 

This 32-bit port control register controls the function of the serial port FSX, 
DX, and CLKX pins. At reset all bits are set to 0. Table 9-4 defines the register 
bits, bit names, and functions. Figure 9-10 shows this port control register. 

Table 9-4. FSX/DX/CLKX Port Control Register Bits Summary 


BIT 

NAME 

FUNCTION 

0 

CLKXFUNC 

CLKXFUNC controls the function of CLKX. If CLKXFUNC 
= 0, CLKX is configured as a general-purpose digital I/O 
port. If CLKXFUNC = 1, CLKX is a serial port pin. 

1 

CLKXT/0 

If CLKXT/0 = 0, CLKR is configured as a general-purpose 
input pin. If CLKXT/0 = 1, CLKX is configured as a gener¬ 
al-purpose output pin. 

2 

CLKXDATOUT 

Data output on CLKX. 

3 

CLKXDATIN 

Data input on CLKX. A write has no effect. 

4 

DXFUNC 

DXFUNC controls the function of DX. If DXFUNC = 0, DX 
is configured as a general-purpose digital I/O port. If 
DXFUNC = 1, DX is a serial port pin. 

5 

DXT/0 

If DXT/0 = 0, DX is configured as a general-purpose input 
pin. If DXT/0 = 1, DX is configured as a general-purpose 
output pin. 

6 

DXDATOUT 

Data output on DX. 

7 

DXDATIN 

Data input on DX. A write has no effect. 

8 

FSXFUNC 

FSXFUNC controls the function of FSX. If FSXFUNC = 0, 
FSX is configured as a general-purpose digital 1/0 port. If 
FSXFUNC = 1, FSX is a serial port pin. 

9 

FSXT/0 

If FSXT/0 = 0, FSX is configured as a general-purpose input 
pin. If FSXT/0 = 1, FSX Is configured as a general-purpose 
output pin. 

10 

FSXDATOUT 

Data output on FSX. 

11 

FSXDATIN 

Data input on FSX. A write has no effect. 

12-31 

Reserved 

Read as 0. 
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NOTE; XX = Reserved bit, read as 0. 
R = read, W = write. 


Figure 9-10. FSX/DX/CLKX Port Control Register 


9.2.3 FSR/DR/CLKR Port Control Register 

This 32-bit port control register is controlled by the function of the serial port 
FSR, DR, and CLKR pins. At reset all bits are set to 0. Table 9-5 defines the 
register bits, the bit names, and functions. Figure 9-11 illustrates this port 
control register. 

Table 9-5. FSR/DR/CLKR Port Control Register Bits Summary 


BIT 

NAME 

FUNCTION 

0 

CLKRFUNC 

CLKRFUNC controls the function of CLKR. If CLKRFUNC 
= 0, CLKR is configured as a general-purpose digital I/O 
port. If CLKRFUNC = 1, CLKR is a serial port pin. 

1 

CLKRT/0 

If CLKRT/0 = 0, CLKR is configured as a general-purpose 
input pin. If CLKRT/0 = 1, CLKR is configured as a gener¬ 
al-purpose output pin. 

2 

CLKRDATOUT 

Data output on CLKR. 

3 

CLKR DATIN 

Data input on CLKR. A write has no effect. 

4 

DRFUNC 

DRFUNC controls the function of DR. If DRFUNC = 0, DR 
is configured as a general-purpose digital I/O port. If 
DRFUNC = 1, DR is a serial port pin. 

5 

drT/o 

If DRT/0 = 0, DR is configured as a general-purpose input 
pin. If DRT/0 = 1, TCLK is configured as a general-purpose 
output pin. 

6 

DRDATOUT 

Data output on DR. 

7 

DRDATIN 

Data input on DR. A write has no-effect. 

8 

FSRFUNC 

FSRFUNC controls the function of FSR. If FSRFUNC = 0, 
FSR is configured as a general-purpose digital I/O port. If 
FSRFUNC = 1, FSR is a serial port pin. 

9 

FSRT/0 

If FSRT/0 = 0, FSR is configured as a general-purpose input 
pin. If FSRT/0 = 1, FSR is configured as a general-purpose 
output pin. 

10 

FSRDATOUT 

Data output on FSR. 

11 

FSR DATIN 

Data input on FSR. A write has no effect. 

12-31 

Reserved 

Read as 0. 
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NOTE: XX = Reserved bit, read as 0. 
R = read, W = write. 


Figure 9-11. FSR/DR/CLKR Port Control Register 


9.2.4 Receive/Transmit Timer Control Register 

A 32-bit receive/transmit timer control register contains the control bits for the 
timer module. At reset all bits are set to 0. Table 9-6 lists the register bits, bit 
names, and functions. Bits 5-0 control the transmitter timer. Bits 11-6 control 
the receiver timer. Figure 9-12 shows the register. 
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Table 9-6. Receive/Transmit Timer Control Register 


BIT 

NAME 

FUNCTION 

0 

XGO 

The XGO bit resets and starts the transmit timer counter. When 
XGO is set to 1 and the timer is not held, the counter is zeroed 
and begins incrementing on the next rising edge of the timer 
input clock. The XGO bit is cleared on the same rising edge. 
Writing 0 to XGO has no effect on the transmit timer. 

1 

XHLD 

Transmit counter hold signal. When this bit is set to 0, the 
counter is disabled and held in its current state. The internal 
divide-bv-two counter is also held so that the counter will 
continue where it left off when XHLD is set to 1. The timer 
registers may be read and modified while the timer is being 
held. RESET has priority over XHLD. 

2 

XC/P 

XCIock/Pulse mode control. When XC/P = 1, the clock mode 
is chosen. The signalling of the status fla£ and external output 
has a 50-percent duty cycle. When XC/P = 0, the status flag 
and external output are active for one CLKOUT cycle during 
each timer period. 

3 

XCLKSRC 

This bit specifies the source of the transmit timer clock. When 
XCLKSRC = 1, an internal clock with frequency equal to one- 
half the CLKOUT frequency is used to increment the counter. 
When XCLKSRC = 0, an external signal from the CLKX pin can 
be used to increment the counter. The external clock source is 
synchronized Internally, thus allowing for external asynchro¬ 
nous clock sources that do not exceed the specified maximum 
allowable external clock frequency, i.e., less than f(H1 )/2.6. 

4 

Reserved 

Read as zero. 

5 

XTSTAT 

This bit indicates the status of the receive timer. It tracks what 
would be the output of the uninverted CLKX pin. This flag sets 
a CPU interrupt on a transition from 0 to 1. A write has no 
effect. 

6 

RGO 

The RGO bit resets and starts the receive timer counter. When 
RGO is set to 1 and the timer is not held, the counter is zeroed 
and begins incrementing on the next rising edge of the timer 
input clock. The RGO bit Is cleared on the same rising edge. 
Writing 0 to RGO has no effect on the receive timer. 

7 

RHLD 

Receive counter hold signal. When this bit Is set to 0, the 
counter is disabled and held in its current state. The Internal 
divide-bv-two counter is also held so that the counter will 
continue where it left off when RHLD is set to 1. The timer 
registers may be read and modified while the timer is being 
held. RESET has priority over RHLD. 
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Table 9-6. Receive/Transmit Timer Control Register (Concluded) 


BIT 

NAME 

FUNCTION 

8 

RC/P 

RCIock/Pulse mode control. When RC/P = 1, the clock mode 
is chosen. The signalling of the status flag and external output 
has a 50-percent duty cycle. When RC/P = 0, the status flag 
and external output are active for one CLKOUT cycle during 
each timer period. 

9 

RCLKSRC 

This bit specifies the source of the receive timer clock. When 
RCLKSRC = 1, an Internal clock with frequency equal to one- 
half the CLKOUT frequency is used to increment the counter. 
When RCLKSRC = 0, an external signal from the CLKR pin can 
be used to increment the counter. The external clock source is 
synchronized internally, thus allowing for external asynchro¬ 
nous clock sources that do not exceed the specified maximum 
allowable external clock frequency, i.e., less than f(H1)/2.6. 

10 

Reserved 

Read as zero. 

11 

RTSTAT 

This bit indicates the status of the receive timer. It tracks what 
would be the output of the uninverted CLKR pin. This flag sets 
a CPU interrupt on a transition from 0 to 1. A write has no 
effect. 

12-31 

Reserved 

Read as 0. 
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NOTE: XX = Reserved bit, read as 0. 
R = read, W = write. 


Figure 9-12. Receive/Transmit Timer Control Register 


9.2.5 Receive/Transmit Timer Counter Register 

The timer counter register is a 32-bit register (see Figure 9-13). Bits 15-0 are 
the transmit timer counter, and bits 31-16 are the receive timer counter. Each 
counter is set to 0 whenever it increments to the value of the counter. It is also 
set to 0 at reset. 

31_16 

I RECEIVE COUNTER I 


15 _0 

_ TRANSMIT COUNTER _ 

NOTE; All bits are read/write. 

Figure 9-13. Receive/Transmit Timer Counter Register 
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9.2.6 Receive/Transmit Timer Period Register 

The timer period register is a 32-bit register (see Figure 9-14) Bits 15-0 are 
the timer transmit period, and bits 31 -16 are the receive period. Each register 
is used to specify the period of the timer. It is also set to 0 at reset. 

31_ 16 

r RECEIVE PERIOD | 


15 _0 

I TRANSMIT PERIOD | 

NOTE: All bits are read/write. 

Figure 9-14. Receive/Transmit Timer Period Register 


9.2.7 Data Transmit Register 

When the data transmit register (DXR) is loaded, the transmitter loads the 
word into the transmit shift register (XSR), and the bits are shifted out. The 
delay from a write to DXR until an FSX occurs (or can be accepted) Is two 
CLKX cycles. The word is not loaded into the shift register until the shifter is 
empty. When DXR is loaded into XSR, the XRDY bit is set, specifying that the 
buffer is available to receive the next word. Four tap points within the transmit 
shift register are used to transmit the word. These tap points correspond to 
the four data word sizes and are illustrated in Figure 9-15. The shift Is a left- 
shift (LSB to MSB) with the data shifted out of the MSB corresponding to the 
appropriate tap point. 


^ shift direction ^ 

31 24 23 16 15 8 7 0 
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32-bit 24-bit 16-bit 8-bit 

word tap word tap word tap word tap 

Figure 9-15. Transmit Buffer Shift Operation 


9.2.8 Data Receive Register 

When serial data is input, the receiver shifts the bits into the receive shift reg¬ 
ister (RSR). When the specified number of bits are shifted in, the data receive 
register (DRR) Is loaded from RSR and the RRDY status bit is set. The receiver 
is double-buffered. If the DRR has not been read and the RSR is full, the re¬ 
ceiver is frozen. New data coming into the DR pin Is Ignored. The receive 
shifter will not write over the DRR. The DRR must be read to allow new data 
in the RSR to be transferred to the DRR. When a write to DRR occurs at the 
same time that a RSR to DRR transfer takes place, the RSR to DRR transfer 
has priority. 

Data is shifted to the left (LSB to MSB). Figure 9-16 illustrates what happens 
when words less than 32 bits are shifted into the serial port. In this figure, it 
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is assumed that an 8-bit word is being received and that the upper three bytes 
of the receive buffer are originally undefined. In the first portion of the figure, 
byte a has been shifted in. When byte b is shifted in, byte a is shifted to the 
left. When the data receive register is read, both bytes a and b are read. 


After Byte a 
After Byte b 


^ shift direction 

31 24 23 16 15 8 7 0 


1 X ] 

1 ^ 1 

1 X 1 

1_?_1 


X j X 

a 

b 


Figure 9>16. Receive Buffer Shift Operation 


9.2.9 Serial Port Operation Configurations 

Several configurations are provided for the operation of the serial port clocks 
and timer. The clocks for each serial port can originate either internally or ex¬ 
ternally. Figure 9-17 shows serial port clocking in the I/O mode (FUNC = 0) 
when CLKX is either an input or an output. Figure 9-18 shows clocking in the 
serial port mode (FUNC = 1). Both figures use a transmit section for an ex¬ 
ample. The same relationship holds for a receive section. 


9 


9-20 




Peripherals - Serial Ports 


TSTAT 


INTERNAL 

INTERNAL 

TIMER IN h- CLOCK 


XSR 


DATOUT- 

DATIN- 


EXTERNAL 


INTERNAL 


TSTAT 


•CLKX 


TIMER IN 


XSR 


DATOUT- 
DATIN > 


EXTERNAL 


CLKX 


FUNC = 0 (I/O MODE) 

CLKXI/0 = 1 (CLKX, AN INPUT) 

XCLKSRC = 1 (INTERNAL CLK FOR TIMER) 
(a) 


FUNC = 0 (I/O MODE) 

CLKXI/0 = 1 (CLKX, AN OUTPUT) 
XCLKSRC = 0 (EXTERNAL CLK FOR TIMER) 

(b) 



FUNC 

CLKXI/0 

XCLKSRC 


0 (I/O MODE) 

1 (CLKX, AN INPUT) 

1 (INTERNAL CLK FOR TIMER) 

(0 


FUNC == 0 (I/O MODE) 

CLKXI/0 = 0 (CLKX, AN INPUT) 

XCLKSRC = 0 (EXTERNAL CLK FOR TIMER) 

(d) 


Figure 9-17. Serial Port Clocking in I/O Mode 
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Figure 9-18. Serial Port Clocking In Serial Port Mode 
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9.2.10 Serial Port Timing 

The formula for calculating the frequency of the serial port clock with an in¬ 
ternally generated clock Is dependent upon the operation mode of the serial 
port timers, defined as: 

f (pulse mode) = f (timer clock)/period register 
f (clock mode) = f (timer clock)/(2 x period register) 

An externally generated serial port clock (CLKX or CLKR) has a maximum 
frequency less than f(H1 )/2.6. See serial port timing In Appendix A. 

Transmit data is clocked out on the rising edge of the selected serial port clock. 
Receive data is latched into the receive shift register on the falling edge of the 
serial port clock. All data Is transmitted and loaded MSB first and right-justi¬ 
fied. If less than 32 bits are transferred, the data Is right-justified In the 32-bit 
transmit and receive buffers. Therefore, the LSBs of the transmit buffer are the 
bits that are transmitted. 

The transmit ready (XRDY) signal specifies that the data transmit register 
(DXR) is available to be loaded with new data. XRDY goes active as soon 
as the data is loaded Into the transmit shift register (XSR). The last word may 
still be shifting out when XRDY goes active. If DXR is loaded before the last 
word has completed transmission, the data bits transmitted will be consec¬ 
utive, i.e., the LSB of the first word immediately precedes the MSB of the 
second, with all signalling valid as in two separate transmits. XRDY goes in¬ 
active when DXR is loaded, and remains inactive until the data Is loaded Into 
the shifter. 

The receive ready (RRDY) signal is active as long as a new word of data Is 
loaded into the data receive register and has not been read. As soon as the 
data Is read, the RRDY bit Is turned off. 

When FSX Is specified as an output, the activity of the signal is determined 
solely by the internal state of the serial port. When a fixed data rate is speci¬ 
fied, FSX goes active when DXR is loaded into XSR to be transmitted out. 
One serial clock cycle later, FSX turns Inactive and data transmission begins. 
When a variable data rate is specified, the FSX pin Is activated when the data 
transmission begins, and remains active during the entire transmission of the 
word. Again, the data is transmitted one clock cycle after It Is loaded into the 
data transmit register. 

An Input FSX In the fixed data rate mode should go active for at least one se¬ 
rial clock cycle and then inactive to initiate the data transfer. The transmitter 
then transmits the number of bits specified by the LEN bits. In the variable 
data rate mode, the transmitter begins transmitting as soon as FSX goes active 
until the number of specified bits has been shifted out. In the variable data 
rate mode, when the FSX status changes prior to all the data bits being shifted 
out, the transmission completes and the DX pin is placed In a high Impedance 
state. An FSR input Is exactly complementary to the FSX. 

When using an external FSX, If DXR and XSR are empty, a write to DXR re¬ 
sults in a DXR to XSR transfer. This data Is held in the XSR until an FSX oc¬ 
curs. When the external FSX is received, the XSR begins shifting the data. If 
XSR Is waiting for the external FSX, a write to DXR will change DXR, but a 
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DXR to XSR transfer will not occur. XSR begins shifting when the external 
FSX is received, or when reset using XRESET. 

Continuous Transmit and Receive Modes 

When continuous mode is chosen, consecutive writes do not generate or ex¬ 
pect new sync pulse signalling. Only the first word of a block begins with an 
active synchronization. Thereafter, data continues to be transmitted as long 
as new data Is loaded into DXR before the last word has been transmitted. 
As soon as TXRDY is active and all of the data has been transmitted out of the 
shift register, the OX pin is placed in a high impedance state, and a subsequent 
write to DXR initiates a new block and a new FSX. 

Similarly with FSR, the receiver continues shifting in new data and loading 
DRR. If the data receive buffer Is not read before the next word Is shifted In, 
subsequent incoming data will be lost. The RFSM bit can be used to termi¬ 
nate the receive continuous mode. 
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Handshake Mode 

The handshake mode (HS = 1) allows for direct connection between proces¬ 
sors. In this mode, all data words are transmitted with a leading 1 (see Figure 
9-19). For example. If an 8-bit word Is to be transmitted, the first bit sent Is 
a 1, followed by the 8-bit data word. 

In this mode, once the serial port transmits a word, it will not transmit another 
word until it receives a separately transmitted zero bit. Therefore, the 1 bit that 
precedes every data word Is, In effect, a request bit. 


data word 
(8-bits) 


DX- 


<iXXXXXXXX> 


leading one 


Figure 9-19. Data Word Format in Handshake Mode 


After a serial port receives a word (with the leading 1), and it has been read 
from the DRR, it sends a single 0 to the transmitting serial port. Thus, the 
single 0 bit acts as an acknowledge bit (see Figure 9-20). This single ac¬ 
knowledge bit Is sent every time the DRR Is read, even if the DRR does not 
contain new data. 



DX 

single zero 

Figure 9-20. Single Zero Sent as an Acknowledge 


When the serial port is placed in the handshake mode, the insertion and de¬ 
letion of a leading 1 for transmitted data, the sending of a 0 for acknowl¬ 
edgement of received data, and the waiting for this acknowledge bit are all 
performed automatically. Using this scheme, it Is simple to connect processors 
with no external hardware and guarantee secure communication. A typical 
configuration is shown in Figure 9-21. 

In the handshake mode, FSX is automatically configured as an output. Con¬ 
tinuous mode Is automatically disabled. After a system reset or XRESET, the 
transmitter is always permitted to transmit. The transmitter and receiver must 
be reset when entering the handshake mode. 
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Figure 9-21. Direct Connection Using Handshake Mode 


9.2.11 Serial Port Interrupt Sources 

A serial port has four interrupt sources: 

1) The transmit timer interrupt: The rising edge of XTSTAT causes a single 
cycle interrupt pulse to occur. When XTINT is 0, this interrupt pulse is 
disabled. 

2) The receive timer interrupt: The rising edge of RTSTAT causes a single 
cycle Interrupt pulse to occur. When RTINT is 0, this Interrupt pulse Is 
disabled. 

3) The transmitter Interrupt: Occurs immediately following a DXR to XSR 
transfer. The transmitter interrupt is a single cycle pulse. When the 
global serial-port control register XINT Is 0, this interrupt pulse Is disa¬ 
bled. 

4) The receiver interrupt: Occurs Immediately following a RSR to DRR 
transfer. The receiver interrupt is a single cycle pulse. When the global 
serial-port control register RINT is 0, this Interrupt pulse is disabled. 

The transmit timer Interrupt pulse is ORed with the transmitter Interrupt pulse 
to create the CPU transmit Interrupt flag XINT. The receive timer Interrupt 
pulse is ORed with the receiver interrupt pulse to create the CPU receive in¬ 
terrupt flag RINT. 

9.2.12 Serial Port Functional Operation 

The following paragraphs and figures illustrate the functional timing of the 
various serial port modes of operation. The timing descriptions are presented 
assuming that all signal polarities are configured to be positive, i.e. 
CLKXP = CLKRP= DXP=DRP = FSXP = FSRP=0. Logical timing, in situations 
where one or more of these polarities are inverted, is the same but with respect 
to the opposite polarity reference points, I.e. rising vs. falling edges, etc. 

These discussions pertain to the numerous operating modes and configura¬ 
tions of the serial port logic. When It is necessary to switch operating modes 
or change configurations of the serial port, this should be done only when 
XRESET or or RRESET are asserted (low) as appropriate. Therefore, when 
transmit configurations are modified, XRESET should be low, and when re¬ 
ceive configurations are modified, RRESET should be low. When In handshake 
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mode, however, since the transmitter a nd recei ver a re interr elated, any con¬ 
figuration changes should be made with XRESET and RRESET both low. 

All of the various serial port operating configurations can be broadly classified 
in two categories: fixed data rate timing and variable data rate timing. The 
following paragraphs discuss fixed and variable data rate operation and all of 
their variations. 

Fixed Data Rate Timing Operation 

Fixed data rate serial port transfers can occur In two varieties: burst mode and 
continuous mode. In burst mode operation, transfers of single words are se¬ 
parated by periods of inactivity on the serial port. In continuous mode, there 
are no gaps between successive word tranfers, I.e., the first bit of a new word 
is transferred on the next CLKX/R pulse following the last bit of the previous 
word. This occurs continuously until the process is terminated. 

In burst mode with fixed data rate timing, FSX/FSR pulses Initiate transfers, 
and each transfer involves a single word. With an internally generated FSX 
(see Figure 9-22), transmission is Initiated by loading DXR. In this mode, 
there is an approximately 2.5 CLKX cycle delay (depending on CLKX and H1 
frequencies) from DXR being loaded until FSX occurs. With an external FSX, 
the FSX pulse initiates the transfer and the 2.5 cycle delay effectively becomes 
a setup requirement for loading DXR with respect to FSX. Therefore, in this 
case, DXR must be loaded no later than 3 CLKX cycles before FSX occurs. 
Once the XSR is loaded from the DXR, an XINT Is generated. 


CLKX/R 



FSR/FSX 
(EXTERNAL)"" 
FSX 

(INTERNAL)" 
DX/DR- 


t 

DXR 

LOADED 



- < A1 i Cl i t A. 1 - 

I I 

XINT RINT 
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Figure 9-22. Fixed Burst Mode 


In receive operations, once a transfer is initiated, FSR is Ignored until the last 
bit. For burst mode transfers, FSR must be low during the last bit, or another 
transfer will be Initiated. After a full word has been received and transfered to 
the DRR, an RINT is generated. 

In fixed data rate mode, continuous transfers may be performed even if 
R/XFSM=0 as long as properly timed frame synchronization Is provided, or if 
DXR Is reloaded each cycle (with an internally generated FSX), see Figure 
9-23. 
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Figure 9-23. Fixed Continuous Mode With Frame Synch 
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For receive operations and with externally generated FSX, once transfers have 
begun, frame sync pulses are only required during the last bit transferred to 
Initiate another contiguous transfer. Otherwise, frame sync Inputs are Ignored. 
Therefore, continuous transfers will occur if frame sync is held high. With an 
internally generated FSX, there is an approximately 2.5 CLKX cycle delay fol¬ 
lowing DXR being loaded before FSX occurs. This delay occurs each time 
DXR is loaded, therefore, during continuous transmission, the instruction 
which loads DXR must be executed by the N-3 bit, (for an N-bit trans¬ 
mission). Since delays due to pipelining may vary, a conservative margin of 
safety should be incorporated in accounting for this delay. 

Once the process begins, an XINT and an RINT are generated at the beginning 
of each transfer. The XINT indicates that the XSR has been loaded from DXR, 
and can be used to cause DXR to be reloaded. To maintain continuous trans¬ 
mission in this mode, especially with an interally generated FSX, DXR must 
be reloaded early in the ongoing transfer. 

The RINT indicates that a full word has been received and transferred into the 
DRR. RINT is therefore commonly used to Indicate an appropriate time to read 
DRR. 

Continuous transfers are terminated by discontinuing frame sync pulses or, in 
the case of internally generated FSX, not reloading DXR. 

Continuous serial port transfers can be accomplished without the use of frame 
sync pulses if R/XFSM are set to one. In this mode, operation of the serial 
port is similar to continuous operation with frame sync except that a frame 
sync pulse is involved only In the first word transferred, and no further frame 
sync pulses are used. Following the first word transferred (see Figure 9-24), 
no internal frame sync pulses are generated, and frame sync inputs are ig¬ 
nored. Additionally, R/XFSM should be set prior to or during the first word 
transferred, and must be set no later than the transfer of the N-1 bit of the first 
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word, except for transmit operations. For transmit operations in the fixed data 
rate mode, XFSM must be set no later than the N-2 bit. Clearing R/XFSM 
must be performed no later than the N-1 bit to be recognized in the current 
cycle. 


FSR/FSX 

(EXTERNAL) 

FSX 

(INTERNAL) 

DR/DX 



I--II- 

< Ai ) r{j X an / Bi X BN x ~~cr~x 


1 

DXR 

\ 

f 1 

XINT 

t 

SET 

_ 

( t 

XINT 

1 1 

XINT 

LOADED 


R/XFSM 

RINT < 

RINT < 


DXR 

LOAD DXR 

LOAD DXR 


LOADED 

READ DRR 

READ DRR 


Figure 9-24. Fixed Continuous Mode Without Frame Synch 


Timing of RINT and XINT and data transfers to and from DXR and DRR, re¬ 
spectively, are the same as in fixed data rate continuous mode with frame sync. 
This mode of operation also exhibits the same 2.5 CLKX cycle delay following 
DXR being loaded before an internal FSX is generated. As in the case of 
continuous operation in fixed data rate mode with frame sync, DXR must be 
reloaded no later than transmission of the N-3 bit. 

When using continuous operation in fixed data rate mode, R/XFSM may be 
set and cleared as desired, even during active transfers, to enable or disable the 
use of frame sync pulses as dictated by system requirements. Under most 
conditions, the effect of changing the state of R/XFSM occurs during the 
transfer In which the R/XFSM change was made, provided the change was 
made early enough in the transfer. For transmit operations with internal FSX 
in fixed data rate mode, however, a one word delay occurs before frame sync 
pulse generation resumes when clearing XFSM to zero (see Figure 9-25). 
Therefore, one additional word is transferred in this case before the next FSX 
pulse Is generated. Also note that, as discussed previously, clearing XFSM 
will be recognized during the current word being transmitted as long as XFSM 
Is cleared no later than the N-1 bit. Setting XFSM Is recognized as long as 
XFSM Is set no later than the N-2 bit. 
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I I I I I I 

I 1ST WORD I 2ND WORD j 3RD WORD | 4TH WORD j 5TH WORD | 



LOAD SET RESET 

DXR XFSM XFSM 


Figure 9-25. Exiting Fixed Continuous Mode Without Frame Synch, FSX Internal 


Variable Data Rate Timing Operation 

Variable data rate timing also supports operation in either burst or continuous 
mode. Burst mode operation with variable data rate timing is similar to burst 
mode operation with fixed data rate timing. With variable data rate timing (see 
Figure 9-26) however, FSX/R and data timing differs slightly at the beginning 
and end of transfers. Specifically, there are three major differences between 
fixed and variable data rate timing. 


FSR/FSX 

(EXTERNAL) 

FSX 

(INTERNAL) 



DX/DR - 


DXR 

LOADED 


~ r~Ao r^ j y an ) - 

I I 

XINT RINT 


Figure 9-26. Variable Burst Mode 


First, FSX/R pulses typically last for the entire transfer interval, although FSR 
and external FSX are ignored after the first bit transferred. FSX/R pulses In 
fixed data rate mode typically last only one CLKX/R cycle, but can last longer. 

Second, data transfer begins during the CLKX/R cycle in which FSX/R occurs, 
rather than the CLKX/R cycle following FSX/R, as is the case with fixed data 
rate timing. 
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Finally, with variable data rate timing, frame sync inputs are ignored until the 
end of the last bit transferred, rather than the beginning of the last bit trans¬ 
ferred as is the case with fixed data rate timing. 

When transmitting continuously in variable data rate mode with frame sync, 
timing is the same as for fixed data rate mode, besides the differences between 
these two modes as described under burst mode operation with variable data 
rate timing. The only exception to this Is that when operating continuously 
In variable data rate mode (see Figure 9-27), DXR must be reloaded no later 
than the N-4 bit to maintain continuous operation, as opposed to the N-3 bit 
for fixed data rate mode. 


FSR/FSX 
(EXTERNAL). 
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(INTERNAL) 
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DXR 

LOADED 
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DC 


f 
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XINT 
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1 1 
XINT 
RINT 




LOAD LOAD DXR LOAD DXR 

DXR READ DRR READ DRR 


Figure 9-27. Variable Continuous Mode With Frame Synch 


Continuous operation in variable data rate mode without frame sync is also 
similar to continuous operation without frame sync in fixed data rate mode. 
As with variable data rate mode continuous operation with frame sync (see 
Figure 9-28), DXR must be reloaded no later than the N-4 bit to maintain 
continuous operation. Additionally, when R/XFSM is set or cleared In the 
variable data rate mode, the modification must be made no later than the N-1 
bit, for the result to be affected in the current transfer. 
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9.3 DMA Controller 

The TMS320C30 provides an on-chip Direct Memory Access (DMA) con¬ 
troller. The purpose of the DMA controller is to reduce the need for the CPU 
to perform input/output functions. The DMA controller can perform 
input/output operations without interfering with the operation of the CPU. 
Therefore, it is possible to interface the TMS320C30 to slow external memo¬ 
ries and peripherals (A/D's, serial ports, etc.) without reducing the computa¬ 
tional throughput of the CPU. The result Is improved system performance and 
decreased system cost. 

A DMA transfer consists of two operations: a read from a memory location and 
a write to a memory location. The DMA controller can read from and write to 
any location In the TMS320C30 memory map. This Includes all memory- 
mapped peripherals. The operation of the DMA is controlled with the fol¬ 
lowing set of memory-mapped registers: 

• DMA global control register 

• DMA source address register 

• DMA destination address register 

• DMA transfer counter register 

These registers, their memory-mapped addresses, and functions are shown in 
Figure 9-29. Each of these DMA registers will be discussed in the succeeding 
subsections. 


Register 

Peripheral 

Address 

DMA GLOBAL CONTROL 

SOSOOOh 

RESERVED 

808001h 

RESERVED 

808002h 

RESERVED 

808003h 

DMA SOURCE ADDRESS 

808004h 

RESERVED 

808005h 

DMA DESTINATION ADDRESS 

808006h 

RESERVED 

808007h 

DMA TRANSFER COUNTER 

808008h 

RESERVED 

808009h 

RESERVED 

80800Ah 

RESERVED 

80800Bh 

RESERVED 

80800Ch 

RESERVED 

80800Dh 

RESERVED 

80800Eh 

RESERVED 

80800Fh 


Figure 9-29. Memory-Mapped Locations for a DMA Channel 
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9.3.1 DMA Global Control Register 

The global control register controls the state in which the DMA controller 
operates. This register also Indicates the status of the DMA, which changes 
every cycle. Source and destination addresses can be Incremented, decre¬ 
mented, or synchronized using specified global control register bits. At system 
reset, ail bits in the DMA control register are set to 0. Table 9-7 lists the reg¬ 
ister bits, names, and functions. Figure 9-30 shows the bit configuration of 
the global control register. 

Table 9-7. Global Control Register Bits 


BIT 

NAME 

FUNCTION 

0-1 

START 

These bits control the state in which the DMA starts and stops. 
The DMA may be stopped without any loss of data (see Table 9-8). 

2-3 

STAT 

These bits indicate the status of the DMA. These status bits change 
every cycle (see Table 9-9). 

4 

INCSRC 

If INCSRC = 1, the source address is incremented after every read. 

5 

DECSRC 

If DECSRC = 1, the source address is decremented after every read. 

If INCSRC = DECSRC, the source address is not modified after a 
read. 

6 

INCDST 

If INCDST = 1, the destination address is incremented after every 
write. 

7 

DECDST 

If DECDST = 1, the destination address is decremented after every 
read. If INCDST = DECDST, the destination address is not modi¬ 
fied after a write. 

8-9 

SYNCH 

The SYNCH bits determine the timing synchronization between the 
events initiating the source and the destination transfers. The in¬ 
terpretation of the SYNCH bits is shown in Table 9-10. 

10 

TC 

The TC bit affects the operation of the transfer counter. If TC = 0, 
transfers are not terminated when the transfer counter becomes 
zero. If TC = 1, transfers are terminated when the transfer counter 
becomes zero. 

11 

TCI NT 

If TCINT = 1,the DMA Interrupt Is set when the transfer counter 
makes a transition to zero. If TCINT = 0, the DMA interrupt is not 
set when the transfer counter makes a transition to zero. 

12 

Reserved 

Read as zero. 


31 

30 

29 

28 

27 

26 

25 

24 

23 

22 

21 

20 

19 

18 

17 

16 

XX 

s 

XX 

XX 

1 1 

H 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

XX 

H 

15 

14 

13 

12 

11 

10 

9 

8 

7 

6 

5 

4 

3 

2 

1 

0 

XX 

XX 

XX 

XX 

TCINT 


SYNCH 

DECDST 

INCDST 

DECSRC 

INCSRC 

STAT 

START 


R/W R/W R/W R/W R/W R/W R/W R/W R R R/W R/W 

NOTE: XX = Reserved bit, read as 0. 

R = read, W = write. 


Figure 9-30. DMA Global Control Register 
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Table 9-8. START Bits and Operation of the DMA 


START 

FUNCTION 

00 

DMA read or write cycles in progress will be completed, any data read will 
be Ignored. Any pending read or write will be cancelled. The DMA is reset 
so that when started, a new transaction is begun; I.e., a read is performed. 

0 1 

If a read or write has begun, the read or write is completed before stopping, 
I.e. in the middle or at the end of a DMA transfer. If a read or write has 
not begun, no read or write is started. 

1 0 

If a DMA transfer has begun, the entire transfer is completed (including 
both read and write operations) before stopping. If a transfer has not be¬ 
gun, none is started. 

1 1 

DMA starts from reset or restarts from the previous state. 


Table 9-9. STAT Bits and Status of the DMA 


STAT 

FUNCTION 

00 

DMA is being held between DMA transfers (between a read and write). 

This is the value at reset. 

0 1 

DMA is being held In the middle of a DMA transfer, i.e. between a read 
and a write. 

1 0 

Reserved. 

1 1 

DMA busy; i.e., DMA is performing a read or write. 


Table 9-10. SYNCH Bits and Synchronization of the DMA 


SYNCH 

FUNCTION 

00 

No synchronization. Enabled Interrupts are ignored. 

0 1 

Source synchronization. A read is performed when an enabled interrupt 
occurs. 

1 0 

Destination synchronization. A write is performed when an enabled inter¬ 
rupt occurs. 

1 1 

Source and destination synchronization. A read is performed when an en¬ 
abled interrupt occurs. A write is then performed when the next enabled 
interrupt occurs. 
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9.3.2 Destination and Source Address Registers 

The DMA destination and source address registers are 24-bit registers. These 
registers are used when performing the increment and decrement as specified 
by control bits DECSRC, INCSRC, DECDST, and INCDST of the DMA global 
control register. The contents of these registers specify the destination and 
source addresses. The registers are incremented or decremented at the end 
of the corresponding memory access, i.e., source register for a read, destina¬ 
tion register for a write. On system reset, 0 is written to these registers. 


9.3.3 Transfer Counter Register 

The transfer counter register is a 24-bit register, controlled by a 24-bit counter 
that counts down. The counter decrements upon the completion of a DMA 
memory write. In this way, it can be used to control the size of a block of data 
transferred. The transfer counter register is set to 0 at system reset. 


9.3.4 CPU/DMA Interrupt Enable Register 

The CPU/DMA interrupt enable register (IE) is a 32-bit register located in the 
CPU register file. The CPU interrupt enable bits are in locations 10-0. The 
DMA interrupt enable bits are in locations 26-16. A 1 in a CPU/DMA interrupt 
enable register bit enables the corresponding interrupt. A 0 disables the cor¬ 
responding interrupt. At reset, 0 is written to this register. 


9 


Table 9-11 list the bits, names, and functions of the CPU/DMA interrupt en¬ 
able register. Figure 9-31 shows the IE register. The priority and decoding 
scheme of CPU and DMA interrupts is identical. Note that when the DMA 
receives an interrupt, this interrupt is acted upon based upon the SYNCH field 
of the DMA control register. Note that an interrupt may affect the DMA, but 
not the CPU and vice versa. Refer to Section 7. 
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Table 9-11. CPU/DMA Interrupt Enable Register Bits 


BIT 

NAME 

FUNCTION 

0 

EINTO 

Enable external interrupt 0 (CPU) 

1 

EINT1 

Enable external interrupt 1 (CPU) 

2 

EINT2 

Enable external interrupt 2 (CPU) 

3 

EINT3 

Enable external interrupt 3 (CPU) 

4 

EXINTO 

Enable serial port 0 transmit interrupt (CPU) 

5 

ERINTO 

Enable serial port 0 receive interrupt (CPU) 

6 

EXINT1 

Enable serial port 1 transmit interrupt (CPU) 

7 

ERINT1 

Enable serial port 1 receive interrupt (CPU) 

8 

ETINTO 

Enable timer 0 interrupt (CPU) 

9 

ETINT1 

Enable timer 1 interrupt (CPU) 

10 

EDINT 

Enable DMA controller interrupt (CPU) 

11-15 

Reserved 

Read as 0 

16 

EINTO 

Enable external interrupt 0 (DMA) 

17 

EINT1 

Enable external interrupt 1 (DMA) 

18 

EINT2 

Enable external interrupt 2 (DMA) 

19 

EINT3 

Enable external Interrupt 3 (DMA) 

20 

EXINTO 

Enable serial port 0 transmit interrupt (DMA) 

21 

ERINTO 

Enable serial port 0 receive interrupt (DMA) 

22 

EXINT1 

Enable serial port 1 transmit interrupt (DMA) 

23 

ERINT1 

Enable serial port 1 receive interrupt (DMA) 

24 

ETINTO 

Enable timer 0 interrupt (DMA) 

25 

ETINT1 

Enable timer 1 Interrupt (DMA) 

26 

EDINT 

Enable DMA controller Interrupt (DMA) 

27-32 

Reserved 

Read as 0 
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ERINTO 

(DMA) 

EXINTO 
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(DMA) 
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(DMA) 
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(DMA) 
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R/W 
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R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 
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2 
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0 

XX 

XX 

XX 

XX 
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EDINTO 

(CPU) 

ETINT1 

(CPU) 

ETINTO 

(CPU) 

ERINT1 

(CPU) 

EXINT1 

(CPU) 

ERINTO 

(CPU) 

EXINTO 

(CPU) 

EINT3 

(CPU) 

EINT2 

(CPU) 

EINT1 

(CPU) 

EINTO 

(CPU) 






R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 

R/W 


NOTE: XX = Reserved bit, read as 0. 
R = read, W = write. 


Figure 9-31. CPU/DMA Interrupt Enable Register 
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9.3.5 DMA Memory Transfer Operation 

Each DMA memory transfer consists of two parts; 

1) Read data from the address specified by the DMA source register. 

2) Write data that has been read to the address specified by the DMA des¬ 
tination register. 

A transfer is complete only when the read and write are complete. A transfer 
may be stopped by setting the START bits to the desired value. When the 
DMA is restarted (START = 1 1), it completes any pending transfer. 

At the end of a DMA read, the source address is modified as specified by the 
SRCINC and SRCDEC bits of the DMA global control register. At the end of 
a DMA write, the destination address is modified as specified by the DSTINC 
and DSTDEC bits of the DMA global control register. At the end of every 
DMA write, the DMA transfer counter is decremented. 

DMA on-chip reads and writes (reads and writes from on-chip memory and 
peripherals) are single cycle. DMA off-chip reads are two cycles. The first 
cycle is an internal setup with the external read beginning on the following 
cycle. The external read cycle is identifical to a CPU read cycle. DMA off-chip 
writes are identical to CPU off-chip writes. 

Through the 24-bit source and destination registers, the DMA is capable of 
accessing any memory-mapped location in the TMS320C30 memory map. 
Figure 9-32 through Figure 9-34 show the number of cycles a DMA transfer 
requires, depending upon whether the source and destination are on-chip 
memory and peripherals, the external port, or the I/O port. T represents the 
number of transfers to be performed. represents the number of wait states 
for the source read. represents the number of wait states for the destination 
write. Each entry in the table represents the total cycles required to do the T 
transfers, assuming no pipeline conflicts. 

Accompanying each table is a figure illustrating the timing of the DMA trans¬ 
fer. |R| and |W( represent single-cycle reads and writes, respectively. |R.R1 
and |W.W| represent multicycle reads and writes. |Cr| and |Cw| show the 
number of wait cycles for a read and write. |“| represents the cycle used as 
an internal setup for DMA external reads. 
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Figure 9-32. Timing and Number of Cycles for DMA Transfers When Destination 

is On-Chip 


CYCLE 


EDED 

Source On-Chip 

Dest Primary bus 

R 

1 R 1 : : 

w. w. w. wj w 

: 1 Cw 1 

R 1 : : j R 1 : : 

W. W. wi W. W. W. W: 

1 1 : 1 C ^ 1 






Source Primary Bus 

Dest Primary Bus 


R . R . R 1 

1 Cf 1 : 

iw.w 

l-.R.R.RI 

: : : 1 C, 1 : 

wiwi iwiw 

1 Cw 1 : : : : : 

w 

1 C 

w 

w 

1 



Source Expansion Bus 

Dest Primary Bus 


R.R.Rl l-.R.R.RI l-.R.R.R 

1 C, 1 : : 1 Cf 1 : : | C, 

: 1 W. W. W. W| 1 W. W. W. Wl 

• • ■ • ' 1 : : 1 Cw 1 

: 

: 

W. W 

^ w 
i C 

W 

w 

1 


SOURCE 

DESTINATION PRIMARY BUS 

On-Chip 

1+(2 + Cw)r 

Primary Bus 

(2 + Cr+2 + Cw )T 

Expansion Bus 

(2+C|-+ 2C^) 


Figure 9-33. DMA Timing When Destination is a Primary Bus 
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Figure 9-34. DMA Timing When Destination is an Expansion Bus 
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Table 9-12 shows the maximum DMA transfer-rates assuming no wait states 
0). Table 9-13 shows the maximum DMA transfer-rates assuming 
one wait state for the read (0^=1) and no wait states for the write (Cw=0). 
Table 9-14 shows the maximum DMA transfer-rates assuming one wait state 
for the read (C^ =1) and one wait state for the write (C^ )• 

In each table, the complete transfer is considered (I.e., the time to do the read 
and the write). Since one bus access is required for the read and another for 
the write, bus transfer-rates will be twice the transfer-rate. It is also assumed 
that no conflicts with the CPU exist. 

Table 9-12. Maximum DMA Transfer Rates When Cr = Cw = 0 




DESTINATION 


SOURCE 

INTERNAL 

PRIMARY 

EXPANSION 

INTERNAL 

33.3 Mbytes/sec 

33.3 Mbytes/sec 

33.3 Mbytes/sec 

PRIMARY 

22.2 Mbytes/sec 

16.7 Mbytes/sec 

22.2 Mbytes/sec 

EXPANSION 

22.2 Mbytes/sec 

22.2 Mbytes/sec 

16.7 Mbytes/sec 


Table 9-13. Maximum DMA Transfer Rates When C/« - 1, - 0 




DESTINATION 


SOURCE 

INTERNAL 

PRIMARY 

EXPANSION 

INTERNAL 

33.3 Mbytes/sec 

33.3 Mbytes/sec 

33.3 Mbytes/sec 

PRIMARY 

16.7 Mbytes/sec 

13.3 Mbytes/sec 

16.7 Mbytes/sec 

EXPANSION 

16.7 Mbytes/sec 

16.7 Mbytes/sec 

13.3 Mbytes/sec 


Table 9-14. Maximum DMA Transfer Rates When C, = 1, Cm, = 1 




DESTINATION 


SOURCE 

INTERNAL 

PRIMARY 

EXPANSION 

INTERNAL 

33.3 Mbytes/sec 

22.2 Mbytes/sec 

22.2 Mbytes/sec 

PRIMARY 

16.7 Mbytes/sec 

11.1 Mbytes/sec 

16.7 Mbytes/sec 

EXPANSION 

16.7 Mbytes/sec 

16.7 Mbytes/sec 

11.1 Mbytes/sec 
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9.3.6 Synchronization of DMA Channels 

A DMA channel may be synchronized through the use of interrupts. Refer to 
Table 9-10 for the relationship between the SYNCH bits of the DMA global 
control register and the synchronization performed. This section describes the 
following four synchronization mechanisms: 

• No synchronization (SYNCH =0 0) 

• Source synchronization (SYNCH =0 1) 

• Destination synchronization (SYNCH = 1 0) 

• Source and destination synchronization (SYNCH = 11) 
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No Synchronization 

When SYNCH = 0 0, no synchronization is performed. The DMA will perform reads and 
writes whenever there are no conflicts. All Interrupts are ignored, and therefore can be 
considered to be globally disabled. However, no bits in the DMA Interrupt enable register 
are changed. Figure 9-35 shows the synchronization mechanism when SYNCH = 0 0. 



Figure 9-35. No DMA Synchronization 
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Source Synchronization 

When SYNCH = 0 1, the DMA is synchronized to the source (see Figure 9-36). A read 
will not be performed until an interrupt is received by the DMA. Then, ail DMA interrupts 
are disabled globally. However, no bits in the DMA interrupt enable register are changed. 



Figure 9-36. DMA Source Synchronization 
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Destination Synchronization 

When SYNCH = 10, the DMA is synchronized to the destination. First, all interrupts are 
ignored until the read is complete. Though the DMA interrupts may be considered to be 
globally disabled, no bits in the DMA interrupt enable register are changed. A write will 
not be performed until an interrupt is received by the DMA. Figure 9-37 shows the syn¬ 
chronization mechanism when SYNCH = 1 0. 



Figure 9-37. DMA Destination Synchronization 
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Source and Destination Synchronization 

When SYNCH = 1 1, all interrupts are ignored, and therefore can be considered to be 
globally disabled. However, no bits in the DMA interrupt enable register are changed. A 
read is performed when an interrupt is received. A write is performed on the following 
interrupt. Source and destination synchronization when SYNCH = 1 1 is shown in Figure 
9-38. 



Figure 9-38. DMA Source and Destination Synchronization 







































Section 10 


Pipeline Operation 


TMS320C30 operation is controlled by five major functional units: fetch, de¬ 
code, read, execute, and DMA. To provide for maximum processor through¬ 
put, these units can perform in parallel, with each unit operating on a different 
instruction. The overlapping of the fetch, decode, read, and execute oper¬ 
ations of different instructions is called pipelining. The pipelining of these 
operations results in the high performance of the TMS320C30. The ability of 
the DMA to move data within the processor memory space results in an even 
greater utilization of the CPU with fewer interruptions of the pipeline, thus 
yielding greater performance. 

Major topics discussed in this section are as follows: 

• Pipeline Structure (Section 10.1 on page 10-2) 

• Pipeline Conflicts (Section 10.2 on page 10-4) 

- Branch conflicts 

- Register conflicts 

- Memory conflicts 

• Resolving Memory Conflicts (Section 10.3 on page 10-14) 

• Clocking of Memory Accesses (Section 10.4 on page 10-16) 

“ Program fetches 

- Data loads and stores 

- DMA accesses 
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10.1 Pipeline Structure 

The five major units of the TMS320C30 pipeline structure and their function 
are as follows: 
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Fetch Unit (F) 
Decode Unit (D) 


Read Unit (R) 
Execute Unit (E) 


Fetches the instruction words from memory and 
updates the program counter (PC). 

Decodes the instruction word and performs ad¬ 
dress generation. Any modification of the auxiliary 
registers and the stack pointer is controlled by this 
unit. 

If required, reads the operands from memory. 

If required, reads the operands from the register 
file, performs the necessary operation, and if 
needed writes results to the register file. If re¬ 
quired, results of previous operations are written 
to memory. 


DMA Channel (DMA) Reads and writes memory. 


The basic instruction has four levels: fetch, decode, read, and execute. Figure 
10-1 illustrates these four levels of the pipeline structure. The levels are in¬ 
dexed according to instruction and execution cycle. Also indicated is a place 
in the pipeline where all four units operate in parallel; the perfect overlap oc¬ 
curs at cycle {m). Those levels about to be executed are at m+1, and those 
just executed are at m -^. The TMS320C30 pipeline control allows for an ex¬ 
tremely high-speed execution rate by allowing an effective rate of one exe¬ 
cution per cycle. It also manages pipeline conflicts in a way that makes them 
transparent to the user. The user does not need to take any special precautions 
to guarantee correct operation. 


CYCLE 

INSTRUCTION . . . I m-3 I m-2 I m-1 I m 1 m + 1 I m+2 I 


I F I D I R I E I 


J 


F I D I R I E I 


K 

J 


F I D I R I E I 

I F I D 1 R I E I 

T 

PERFECT 

OVERLAP 


Figure 10-1. TMS320C30 Pipeline Structure 
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Priorities have been assigned to each of the functional units. The priorities 
from highest to lowest are: 

• Execute (highest) 

• Read 

• Decode 

• Fetch 

• DMA (lowest). 

When processing of an instruction is ready to pass to the next higher pipeline 
level, but that level is not ready to accept a new input a pipeline conflict oc¬ 
curs. In this case, the lower priority unit waits until the higher priority unit 
completes its currently executing function. 

Despite the DMA controllers low priority, conflicts with the CPU can be min¬ 
imized or even eliminated by suitable data structuring since the DMA con¬ 
troller has its own data and address buses. 
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10.2 Pipeline Conflicts 

The pipeline conflicts of the TMS320C30 can be grouped into the following 
main categories: 

Branch Conflicts Involve most of those instructions or operations which 
read and/or modify the PC. 

Register Conflicts Involve delays that can occur when reading or writing 
registers used for address generation. 

Memory Conflicts Occur when the internal units of the TMS320C30 
compete for memory resources. 

Each of these three types is discussed in the following sections. Examples are 
included. Note in these examples, when data is refetched or an operation is 
repeated, the symbol representing the stage of the pipeline is appended with 
a number. For example, if a fetch is performed again, the initial fetch is labeled 
F1 and the refetch is labeled F2. When an access is detained multiple cycles 
due to a 'not ready,' the symbols RDY and RDY are used to Indicate not ready 
and ready, respectively. 


10.2.1 Branch Conflicts 
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The first class of pipeline conflicts is that which occurs with standard (non- 
delayed) branches, i.e., BR, Bcond, DBcond, CALL, IDLE, RPTB, RPTS, 
REWcond, RETSconcf, interrupts, and reset. Conflicts arise with these In¬ 
structions and operations since during their execution, the pipeline is used 
only for the completion of the operation; other information fetched into the 
pipeline is discarded or refetched, or the pipeline is inactive. This is referred 
to as flushing the pipeline. Flushing the pipeline is necessary in these cases 
to guarantee that portions of succeeding instructions do not Inadvertantly get 
partially executed. TRAPcoa?^/ and C^LLcond are classified somewhat differ¬ 
ently from the other types of branches and are considered later. 

Example 10-1 shows the code and pipeline operation for a standard branch. 
Note that one dummy fetch is performed (FI), and then after the branch ad¬ 
dress is available, a new fetch (F2) is performed. This dummy fetch will affect 
the cache. 
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Example 10-1. Standard Branch 

BR THREE 

MPYF 

ADDF 

SUBF 

AND 


THREE OR 
STI 


; Unconditional branch 
; Not executed 
; Not executed 
; Not executed 
; Not executed 


; Fetched after BR is fetched 


PIPELINE OPERATION 


BR THREE 

OR 

STI 


THREE-^PC 
D I R I E I 

F1 I (nop) I (nop) | 


F2 


D 

F 


RPTS and RPTB both flush the pipeline, thus allowing for the RS, RE, and 
RC registers to be loaded at the proper time relative to the flow of the pipeline. 
If these registers are loaded without the use of RPTS or RPTB, no flushing of 
the pipeline occurs. If none of the repeat modes are being used, RS, RE, and 
RC may be used as general-purpose 32-bit registers without any pipeline 
conflicts occurring. In cases such as the nesting of RPTB due to nested in¬ 
terrupts, it may be necessary to load and store these registers directly while 
using the repeat modes. Since up to four instructions can be fetched before 
entering the repeat mode, loads should be followed by a branch to flush the 
pipeline. If the RC is changing when an instruction is loading it, the direct load 
takes priority over the modification made by the repeat mode logic. 
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Delayed branches are implemented to guarantee the fetching of the next three 
instructions. The delayed branches include BRD, Bconc/D, and DBcondD. 
Example 10-2 shows the code and pipeline operation for a delayed branch. 


10-5 



Pipeline Operation - Pipeline Conflicts 


Example 10-2. Delayed Branch 


THREE 


BRD THREE 

; Unconditional 

MPYF 

; Executed 

ADDF 

; Executed 

SUBF 

; Executed 

AND 

; Not executed 


MPYF ; Fetched after 


delayed branch 


SUBF fetched 


BRD THREE 

MPYF 

ADDF 

SUBF 

MPYF 


PIPELINE OPERATION 

THREE->PC 

F I D 1 R I E I 

I F I D I R I 

I F I D I 

I F I 


E 

R 

D 

F 


E 1 

R |... 

D |... 


10.2.2 Register Conflicts 
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Register conflicts involve the reading or writing of registers used for address¬ 
ing purposes. These conflicts occur when the pertinent register is not ready 
to be used. The registers comprise the following three functional groups: 

Group 1 Auxiliary registers (AR0-AR7), index registers (IRO, IR1), and 
block size register (BK) 

Group 2 Data page pointer (DP) 

Group 3 System stack pointer (SP) 


If an instruction writes to one of these three groups, the use of any register 
within that particular group by the decode unit is delayed until the write is 
complete, i.e. instruction execution is completed. In Example 10-3, an auxil¬ 
iary register is loaded, and a different auxiliary register is used on the next in¬ 
struction. Since the decode stage needs the result of the write to the auxiliary 
register, the decode of this second Instruction is delayed two cycles. Every 
time the decode is delayed, a refetch of the program word is performed; i.e., 
the first fetch of ADDF Is at FI, followed by F2 and F3 (the final fetch). Since 
these are actual refetches, they can cause conflicts with the DMA controller 
and cache hits and misses. 
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Example 10-3. Write to an AR Followed by an AR for Address Generation 

LDI 7,AR1 ; 7 ARl 

NEXT MPYF *AR2,R0 ; Decode delayed 2 cycles 

ADDF 
FLOAT 


PIPELINE OPERATION 


LDI 7,ARl 1 

F 

1 D 

1 R 

7-+AR1 

1 E 1 


MPYF *AR2,R0 


1 F 

1 D1 

1 D2 I 

D3 

ADDF 

FLOAT 



1 FI 

1 F2 1 

F3 


The case for reads of these groups is similar to the case for writes. If an In¬ 
struction must read a member of one of these groups, the use of that particular 
group by the decode for the following instruction is delayed until the read is 
complete. The registers are read at the start of the execute cycle and therefore 
require only a one cycle delay of the following decode. For four registers (IRQ, 
IR1, BK, or DP) no delay is incurred. In all other cases, including the SP, the 
delay occurs. In Example 10-4, two auxiliary registers are added together with 
the result going to an extended-precision register. The next instruction uses 
a different auxiliary register as an address register. 


Example 10-4. A Read of ARs Followed by ARs for Address Generation 


ADDI AR0,AR1,R1 ; ARO + ARl Rl 

NEXT MPYF *++AR2,R0 ; Decode delayed 1 cycle 

ADDF 
FLOAT 
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PIPELINE OPERATION 


ADDI 1 F 

1 D 1 

R 

AR0+AR1-*R1 

1 E 1 

MPYF* ++AR2,R0 

1 F 1 

D1 

1 D2 1 

ADDF 

1 

FI 

1 F2 I 

FLOAT 



1 


Note that while the DBR (decrement and branch) instruction does not use the 
auxiliary registers for addressing, its use of them as loop counters is treated 
as if it did. Therefore, the operation shown in the two previous examples can 
also occur for this instruction. 
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10.2.3 Memory Conflicts 

Possible memory conflicts occur when the memory bandwidth of a physical 
memory space is exceeded. For example, RAM blocks 0 and 1 and the ROM 
block can support only two accesses every cycle. The external interface can 
support only one access per cycle. Some conditions under which memory 
conflicts can be easily avoided are discussed in Section 10.3. 

Memory pipeline conflicts consist of the following four types: 

Program Wait A program fetch is prevented from begin¬ 

ning. 

Program Fetch Incomplete A program fetch has begun, but is not yet 

complete. 

Execute Only An instruction sequence requires three 

CPU-data accesses in a single cycle. 

Hold Everything A primary or expansion bus operation must 

complete before another one can proceed. 

These four types of memory conflicts are discussed in the succeeding para¬ 
graphs and examples provided. 

Program Wait 

Two conditions can prevent the program fetch from beginning: 

• The start of a CPU-data access when: 

- Two CPU-data accesses are made to an internal RAM or ROM 
block, and a program fetch from the same block Is necessary. 

- One of the external ports is starting a CPU-data access, and a 
program fetch from the same port is necessary. 

• A multicycle CPU-data access or DMA-data access over the external bus 
is needed. 
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An example of program wait until a CPU-data access completes is illustrated 
in Example 10-5. In this case, *AR0 and *AR1 are both pointing to data in 
RAM block 0, and the MPYF instruction will be fetched from RAM block 0. 
This results In the conflict shown in Example 10-5. Since no more than two 
accesses can be made to RAM block 0 In a single cycle, the program fetch 
cannot begin, and must wait until the CPU-data accesses are complete. 

Example 10-5. Program Wait Until CPU-Data Access Completes 

ADDF3 *AR0,*AR1,R0 

FIX 

MPYF 

ADDF3 

NEGB 


PIPELINE OPERATION 

*AR0 MEMORY READ 
•AR1 MEIVIORY READ 


ADDF3 *AR0,*AR1,R0 

I F 

I D 

I R I 

E 

I 



FIX 


I F 

I D I 

R 

I E 

I 


MPYF 



(wait) 

I F I 

F 

I D 

I R 

I E 

ADDF 


• 



I F 

I D 

I R 

NEGB 






I F 

I D 


Example 10-6 shows an example of a program wait due to a multicycle data- 
data access or a multicycle DMA access. The ADDF, MPYF, and SUBF are 
fetched from some portion In memory other than the external port the DMA 
requires. The DMA begins a multicycle access. The program fetch corre¬ 
sponding to the CALL is made to the same external port the DMA is using. 
Even though the DMA has the lowest priority, multi-cycle access cannot be 
aborted. The program fetch must therefore wait until the DMA access com¬ 
pletes. 



Example 10-6. Program Wait Due to Multicycle Access 


ADDF I 

F I D 

I R 

I E I 



MPYF 

I F 

I D 

I R I 

E 

I 

SUBF 


i F 

I D I 

R 

I E 

CALL 


(wait) 

I F I 

I 2 cycle DMA access | 

F 

I D 
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Program Fetch Incomplete 


A program fetch incomplete occurs when a program fetch takes more than one 
cycle to complete due to wait states. In Example 10-7, the MPYF and ADDF 
are fetched from memory that supports single-cycle accesses. The SUBF is 
fetched from memory requiring one wait state. 


Example 10-7. Multicycle Program Memory Fetches 

MPYF I F I D I R I E I 

ADDF I F 1 D I R I 

RDY RDY 

SUBF I F I F I 

ADDI I 


E 

D 

F 


I R 
I D 


E 

R 


Execute Only 

The Execute Only type of memory pipeline conflicts occurs when a sequence 
of instructions requires three CPU-data accesses in a single cycle or when 
performing an interlocked load. The three cases where this occurs are: 

• An instruction that performs a store, followed by an instruction that does 
two memory reads. 

• An instruction that performs two stores, followed by an instruction that 
performs at least one memory read. 

• An interlocked load (LDII or LDFI) instruction is performed, and XF1 = 
1 . 
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The first case is shown in Example 10-8. Since this sequence requires three 
data memory accesses and only two are available, only the execute phase of 
the pipeline is allowed to proceed. The dual reads required by the LDF || LDF 
will be delayed one cycle. Note that a refetch of the next instruction can oc¬ 
cur. 

Example 10-8. Single Store Followed by Two Reads 

STF R0,*AR1 ; RO *AR1 


11 

LDF 

LDF 

*AR2,R1 

*AR3,R2 

f 

7 

*AR2 

*AR3 

-► R1 
R2 

in 

parallel 

with 



PIPELINE OPERATION 





STF 

R0,*AR1 

1 F 1 

D 

1 

R0-»'AR1 

R 1 E 1 


LDF| 

|LDF 

1 

F 

1 

D 1 

R1 

•AR2-»R1 

•AR3-+R2 

1 R2 1 

E 





1 

F 1 

D1 

1 D2 1 

R 






1 

FI 

1 F2 I 

D 


Example 10-9 shows a parallel store followed by a single load or read. Since 
the two parallel stores are required, the next CPU-data memory read must wait 
a cycle before beginning. One program memory refetch may occur. 

Example 10-9. Parallel Store Followed by Single Read 


STF 

RO,*AR0 

; RO 

-► 

*AR0 in parallel with 

STF 

R2,*AR1 

; R2 

-> 

*AR1 

ADDF 

lACK 

ASH 

@SUM,R1 

; R1 

+ 

@SUM -* R1 
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PIPELINE OPERATION 


STF R0,*AR1 
STF R2,*AR1 1 

F 

1 D 

1 R 

R0-^*AR0 

R2-^*AR1 

1 E 1 


ADDF @SUM,R1 


1 F 

1 D 

(wait) 

1 R 1 

@SUM 

R 

lACK 



1 F 

1 D1 1 

D2 

ASH 




1 FI 1 

F2 
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The final case involves an interlocked load (LDII or LDFI) instruction and XF1 
= 1. Since the interlocked loads use the XF1 pin as an acknowledge that the 
read can complete, they may need to extend the read cycle, as shown In Ex¬ 
ample 10-10. Note that a program refetch may occur. 

Example 10-10. Interlocked Load 


NOT 

1 F 

1 D 

1 R 

1 E 

1 

LDII 


1 F 

1 D 

XF1=1 

1 R 

XF1=0 

1 R 

ADD I 



1 F 

1 D1 

1 D2 

CMP I 




1 FI 

1 F2 


Hold Everything 

The three types of Hold Everything memory pipeline conflicts are: 

• A CPU-data load or store cannot be performed because an external port 
is busy. 

• An external load that takes more than one cycle. 

• Conditional calls and traps. 

The first type of Hold Everything conflict occurs when one of the external 
ports is busy due to an access that has started but is not complete. In Example 
10-11, the first store is a two-cycle store. The CPU writes the data to an ex¬ 
ternal port. The port control then takes two cycles to complete the data-data 
write. The LDF is a read over the same external port. Since the store is not 
complete, the LDF will continue to be attempted until the port is available. 
For this case, the first dummy fetch occurs at the same time as D2. 

Example 10-11. Busy External Port 

STF R0,@DMA1 

LDF @DMA2,R0 


PIPELINE OPERATION 


STF 

1 F 1 D 

1 2-cycle DMA access 

1 R 1 E 

1 

1 



LDF 

1 F 

1 D 

1 nop 

1 

R 1 

1 E 



1 F 

1 D1 

I 

D2 i 

1 R 




1 FI 

1 

F2 j 

1 D 


I F 1... 
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The second type of Hold Everything conflict involves multicycle data reads. 
The read has begun and continues until completed. In Example 10-12, the 
LDF is performed from an external memory that requires several cycles to 
complete. 

Example 10-12. Multicycle Data Reads 

I F I D I R I E I 

I 2-cycle read | 

LDF @DMA,R0 | F | D | R | R | E | 

IF I D1 I D2 I R |... 

IF I nop I D |... 

I FI I F2 |... 

The final type of Hold Everything conflict deals with conditional calls and 
traps, which are different from the other branch instructions. Whereas the 
other branch instructions are conditional loads, the conditional calls and traps 
are conditional stores, which take one cycle more than a standard branch (see 
Example 10-13). 

Example 10-13. Conditional Calls and Traps 

(store) 

CALLcond | F | D | R | E | E | 

I FI I (nop) I (nop) | F2 | F3 1 D | 

I F I 
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10.3 Resolving Memory Conflicts 

If program fetches and data accesses are performed in such a manner that the 
resources being used cannot provide the necessary bandwidth, the program 
fetch is delayed until the data access Is complete. Certain configurations of 
program fetch and data accesses yield conditions under which the 
TMS320C30 can achieve maximum throughput. Table 10-1 shows how many 
accesses can be performed from the different memory spaces when It Is nec¬ 
essary to do a program fetch and a single data access, and still achieve maxi¬ 
mum performance (one cycle). There are four cases that achieve one cycle 
maximization (see Table 10-1). Table 10-2 shows how many accesses can 
be performed from the different memory spaces when it is necessary to do a 
program fetch and two data accesses, still achieving maximum performance 
(one cycle). There are six cases that achieve this maximization (see Table 
10 - 2 ). 

Table 10-1. One Program Fetch and One-Data Access for Maximum 

Performance 


CASE# 

PRIMARY BUS 
ACCESSES 

ACCESSES FROM 
DUAL-ACCESS 
INTERNAL MEMORY 

EXPANSION BUS 
OR PERIPHERAL 
ACCESSES 





1 

1 

1 

- 

2 

1 

- 

1 

3 


2 from any 
combination 
of internal memory 


4 

- 

1 

1 
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Table 10-2. One Program Fetch and Two Data Accesses for 
Maximum Performance 


CASE # 

PRIMARY BUS 
ACCESSES 

ACCESSES FROM 
DUAL-ACCESS 
INTERNAL MEMORY 

EXPANSION OR 
PERIPHERAL BUS 
ACCESSES 

■■■■■■I 




1 

1 

2 from any 
combination 
of internal memory 

— 

2 

1 Program 

1 Data 

1 Data 

3 

1 Data 

1 Data 

1 Program 

4 


2 from same internal 
memory block and 

1 from a different 
internal memory 
block 


5 


3 from different 
internal memory 
blocks 


6 


2 from any 
combination 
of internal memory 

1 
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10.4 Clocking Of Memory Accesses 

Internal clock phases (H1 and H3) and their relationship to memory accesses 
are discussed in this section to show how the TMS320C30 handles multiple 
memory accesses. Whereas the previous section discussed the interaction 
between sequences of instructions, this section discusses the flow of data on 
an individual instruction basis. 

Each major clock period of 60 ns is composed of two minor clock periods of 
30 ns, labeled as H3 and H1. 

Major Clock Period "►! 


H1 


H3 


The precise operation of memory reads and writes can be defined, based upon 
these minor clock periods. The types of memory operations which can occur 
are program fetches, data loads and stores, and DMA accesses. 


10.4.1 Program Fetches 

Internal program fetches are always performed during H3 unless a single data 
store must occur at the same time, due to another instruction in the pipeline. 
In this case, the program fetch occurs during HI and the data store during 
H3. 
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External program fetches always start at the beginning of H3 with the address 
being presented on the external bus. At the end of HI, they are completed 
with the latching of the instruction word. 


10.4.2 Data Loads and Stores 

Four types of instructions perform loads, memory reads, and stores: two-ope¬ 
rand instructions, three-operand instructions, multiplier/ALU operation with 
store instructions, and parallel multiply and add instructions. See Section 6 for 
detailed information on addressing modes. 

Two-Operand Instruction Memory Accesses 

Two-operand instructions include all those Instructions with bits 31 -29 being 
000 or 010 (see Figure 10-2). In the case of a data read, bits 15-0 represent 
the src operand. Internal data reads are always performed during HI. External 
data reads always start at the beginning of H3 with the address being pre¬ 
sented on the external bus, and complete with the latching of the Instruction 
word at the end of HI. 
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In the case of a data store, bits 15-0 represent the dst operand. Internal data 
stores are performed during H3. External data stores always start at the be¬ 
ginning of H3 with the address and data being presented on the external bus. 


31 

24 23 


1615 

87 

0 

gggi 


1 

G 

1 1 —1— I — 

dst(src) 

— 1 — 1 — 1 —1—1— 1 — I I 1 —1 1—1—1—1—1 

src(dst) 


Figure 10-2. Two-Operand Instruction Word 


Three-Operand Instruction Memory Reads 

Three-operand instructions include all instructions with bits 31 -29 being 001 
(see Figure 10-3). The source operands, srcl and src2, come from either reg¬ 
isters or memory. When one or more of the source operands are from memory, 
these instructions are always memory reads. 

If only one of the source operands is from memory (either srcl or src2) and 
is located in internal memory, the data is read during HI. If the single memory 
source operand is in external memory, the read starts at the beginning of H3, 
with the address being presented on the external bus, and completes with the 
latching of the data word at the end of HI. 

If both source operands are to be fetched from memory, then several cases 
occur. If both operands are located in internal memory, the src1 Is performed 
during H3 and src2 during HI, thus completing two memory reads in a single 
cycle. 


If srcl Is in Internal memory and src2 in external memory, the src2 access Is 
begun at the start of H3 and latched at the end of HI. At the same time, the 
srcl access to internal memory Is performed during H3. Again, two memory 
reads are completed in a single cycle. 


If srcl Is in external memory and src2 in internal memory, two cycles are nec¬ 
essary to complete the two reads. In the first cycle, the internal src2 access 
is performed. The srcl is also performed, but not latched until the next H3. 

If srcl and src2 are both from external memory, two cycles are required to 
complete the two reads. In the first cycle, the srcl access is performed and 
loaded on the next H3; in the second cycle, the src2 access is performed and 
loaded on that cycle's HI. 
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31 24 23 1615 8 7 0 


ggg 




mom 



Figure 10-3. Three-Operand Instruction Word 
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Operations with Parallel Stores 

The next class of instructions includes all instructions that have a store, in 
parallel with another instruction. Bits 31 and 30 for these instruction are equal 
to 1 1. 

For those operations that perform a multiply or ALU operation in parallel with 
a store, the instruction word format is shown in Figure 10-4. If the store op¬ 
eration to dst2 is external or internal, it is performed during H3. 

If the memory read operation is external, it is started at the beginning of H3 
and latched at the end of HI. If the memory read operation is internal, it is 
performed during HI. Note that memory reads are performed by the CPU 
during the read (R) phase of the pipeline, and stores during the execute (E) 
phase. 


31 24 23 1615 8 7 0 


1 ' 1 


IW 

—1—I— 

srcl 

—1—1— 
src3 

—1—1—1—1—1—1—1— 
dst2 

—1—\—1—1—1— \ —1— 
src2 


Figure 10-4. A Multiply or CPU Operation with a Parallel Store 
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The instruction word format for those instructions that have parallel stores to 
memory is shown in Figure 10-5. If both destination operands, dsti and 
dst2, are located in internal memory, dsti is stored during H3 and dst2 during 
HI, thus completing two memory stores in a single cycle. 

If dst1 is in external memory and dst2 in internal memory, the dsti store is 
begun at the start of H3. The dst2 store to internal memory is performed 
during HI. Again, two memory stores are completed in a single cycle. 

If dsti is in internal memory and dst2 in external memory, two cycles are ne¬ 
cessary to complete the dst2 store. In the first cycle, the internal dsti store is 
performed during H3. During the next cycle, the dst2 store Is performed be¬ 
ginning in H3. 

If dsti and dst2 are both from external memory, two cycles are necessary to 
complete the dst2 store. In the first cycle, the dsti access is performed; in the 
second cycle, the dst2 access is performed. 


31 24 23 1615 8 7 0 


DO 


mm 

wnw 

—1—1— 
srcl 

—I—I—1—1—1— \ —I— 

dsti 

—1—1—1—1—1— ] —1— 
dst2 


Figure 10-5. Two Parallel Stores 
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Parallel Multiplies and Adds 

The considerations of memory addressing for parallel multiplies and adds is 
similar to that for three-operand instructions. The parallel multiplies and adds 
include all instructions with bits 31 -30 equal to 10 (see Figure 10-6). 

For these operations, src3 and src4 are both located in memory. If both op¬ 
erands are located in internal memory, src3 is performed during H3 and src4 
during HI, thus completing two memory reads in a single cycle. 

If src3 is in internal memory and src4 in external memory, the src4 access is 
begun at the start of H3 and latched at the end of HI. At the same time, the 
src3 access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 

If src3 is in external memory and src4 in internal memory, two cycles are nec¬ 
essary to complete the two reads. In the first cycle, the internal src4 access 
is performed. During the H3 of the next cycle, the src3 access is performed. 

If src3 and src4 are both from external memory, two cycles are necessary to 
complete the two reads. In the first cycle, the src3 access is performed; in the 
second cycle, the src4 access is performed. 
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Section 11 


Assembly Language Instructions 


The TMS320C30 assembly language instruction set supports numeric¬ 
intensive signal processing and general-purpose applications, the instructions 
are organized into major groups consisting of load and store, two- or three- 
operand arithmetic/logical, parallel, program control, and Interlocked oper¬ 
ations instructions. The addressing modes used with the instructions are 
described In Section 6. 

An additional feature of the TMS320C30 instruction set is the capability of 
using one of 19 condition codes with any of the 10 conditional instructions, 
such as LDVcond. This section defines the condition codes and flags. 

The assembler allows optional syntax forms to simplify the assembly language 
for special-case instructions. These optional forms are listed and explained. 

Each of the Individual Instructions Is described and listed In alphabetical order. 
An Illustration showing an example instruction (see pages 11-15 through 
11-17) is provided to show the special format used and explain its content. 

Major topics discussed In this section are as follows: 

• Instruction Set (Section 11.1 on page 11 -2) 

- Load and store Instructions 

“ Two-operand arithmetic/logical Instructions 

- Three-operand arithmetic/logical Instructions 

- Program control Instructions 

- Interlocked operations instructions 
~ Parallel operations instructions 

• Condition Codes and Flags (Section 11.2 on page 11 -8) 

• Individual Instructions (Section 11.3 on page 11-11) 

- Symbols and abbreviations used In instructions 
Optional assembler syntaxes 

- Individual instruction descriptions alphabetized and including: 

Syntax 

Operation 

Operands 

Encoding 

Description 

Cycles 

Status bits 

Mode bit 

Example(s) 


11-1 
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11.1 Instruction Set 

The TMS320C30 instruction set is exceptionally well suited to digital signal 
processing and other numeric-intensive applications. All instructions are a 
single machine word long, and most instructions take a single cycle to exe¬ 
cute. In addition to multiply and accumulate instructions, the TMS320C30 
possesses a full complement of general-purpose Instructions. 

The instruction set contains 114 instructions organized Into the following 
functional groups: 

• Load and store 

• Two-operand arithmetic/logical 

• Three-operand arithmetic/logical 

• Program control 

• Interlocked operations 

• Parallel operations. 

Each of these groups is discussed In the succeeding subsections. 

11.1.1 Load and Store Instructions 

The TMS320C30 supports 12 load and store instructions (see Table 11-1). 
These instructions can: 

• Load a word from memory into a register, 

• Store a word from a register into memory, or 

• Manipulate data on the system stack. 

Two of these instructions can load data conditionally. This Is useful for locat¬ 
ing the maximum or minimum value in a data set. See Section 11.2 for detailed 
Information on condition codes. 

Table 11-1. Load and Store Instructions 


INSTRUCTION 

DESCRIPTION 

INSTRUCTION 

DESCRIPTION 

LDE 

Load floating-point exponent 

POP 

Pop integer from stack 

LDF 

Load floating-point value 

POPF 

Pop floating-point value from 
stack 

LD^cond 

Load floating-point value 
conditionally 

PUSH 

Push integer on stack 

LDI 

Load integer 

PUSHF 

Push floating-point value on 
stack 

LD\cond 

Load integer conditionally 

STF 

Store floating-point value 

LDM 

Load floating-point mantissa 

STI 

Store integer 
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11.1.2 Two-Operand Instructions 

The TMS320C30 supports a complete set of two-operand arithmetic and 
logical instructions. The two operands are the source and destination. The 
source operand may be a memory word, a register, or a part of the instruction 
word. The destination operand is always a register. 

These instructions provide integer, floating-point, or logical operations, and 
multiprecision arithmetic. Table 11-2 lists these instructions. 


Table 11-2. Two-Operand Instructions 


INSTRUCTION 

DESCRIPTION 

INSTRUCTION 

DESCRIPTION 

ABSF 

Absolute value of a floating¬ 
point number 

NORM 

Normalize floating-point value 

ABSI 

Absolute value of an integer 

NOT 

Bitwise logical-complement 

ADDC 

t 

Add integers with carry 

OR 


Bitwise logIcal-OR 

ADDF 

t 

Add floating-point values 

RND 

Round floating-point value 

ADDI 

t 

Add Integers 

ROL 

Rotate left 

AND 

t 

Bitwise logical-AND 

ROLC 

Rotate left through carry 

ANDN 

1 

Bitwise logical-AND with 
complement 

ROR 

Rotate right 

ASH 

B 

Arithmetic shift 

RORC 

Rotate right through carry 

CMPF 

B 

Compare floating-point values 

SUBB 

B 

Subtract integers with borrow 

CMPI 

B 

Compare integers 

SUBC 

Subtract integers conditionally 

FIX 

Convert floating-point value to 
integer 

SUBF 

Subtract floating-point values 

FLOAT 

Convert integer to floating-point 
value 

SUBI 

Subtract integer 

LSH 

1 

Logical shift 

SUBRB 

Subtract reverse integer with 
borrow 

MPYF 

1 

Multiply floating-point values 

SUBRF 

Subtract reverse floating-point 
value 

MPYI 

B 

Multiply integers 

SUBRI 

Subtract reverse integer 

NEGB 

Negate integer with borrow 

TSTB 

B 

Test bit fields 

NEGF 

Negate floating-point value 

XOR 

B 

Bitwise exclusive-OR 

NEGI 

Negate integer 



tTwo- and three-operand versions 
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11.1.3 Three-Operand Instructions 

Most instructions have only two operands; however, some arithmetic and 
logical instructions have three-operand versions. Three-operand instructions 
allow the TMS320C30 to read two operands from memory or the CPU register 
file in a single cycle and store the results in a register. The following differ¬ 
entiates the two- and three-operand Instructions; 

• Two-operand instructions have a single source operand (or shift count) 
and a destination operand. 

• Three-operand instructions may have two source operands (or one 
source operand and a count operand) and a destination operand. A 
source operand may be a memory word or a register. The destination 
of a three-operand instruction is always a register. 

Table 11-3 lists the instructions that have three-operand versions. Note that 
the '3' in the mnemonic can be omitted from three-operand instructions (see 
Section 11.3.2). 

Table 11-3. Three-Operand Instructions 


INSTRUCTION 

DESCRIPTION 

INSTRUCTION 

DESCRIPTION 

ADDC3 

Add with carry 

MPYF3 

Multiply floating-point values 

ADDF3 

Add floating-point values 

MPYI3 

Multiply integers 

ADDI3 

Add integers 

OR3 

Bitwise logical-OR 

AND3 

Bitwise logicai-AND 

SUBB3 

Subtract integers with borrow 

ANDN3 

Bitwise logical-AND with 
complement 

SUBF3 

Subtract floating-point values 

ASH3 

Arithmetic shift 

SUBI3 

Subtract integers 

CMPF3 

Compare floating-point values 

TSTB3 

Test bit fields 

CMPI3 

Compare integers 

XOR3 

Bitwise exclusive-OR 

LSH3 

Logical shift 
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11.1.4 Program Control Instructions 

The program-control instruction group consists of all of those instructions 
which affect program flow. The repeat mode allows repetition of a block of 
code (RPTB) or of a single line of code (RPTS). Both standard and delayed 
(single-cycle) branching are supported. Several of the program control in¬ 
structions are capable of conditional operations (see Section 11.2 for detailed 
information on condition codes). Table 11-4 lists the program control in¬ 
structions. 
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Table 11-4. Program Control Instructions 


INSTRUCTION 

DESCRIPTION 

INSTRUCTION 

DESCRIPTION 

Bcond 

Branch conditionally (standard) 

IDLE 

Idle until interrupt 

^condD 

Branch conditionally (delayed) 

NOP 

No operation 

BR 

Branch unconditionally 
(standard) 

RETI cond 

Return from interrupt 
conditionally 

BRD 

Branch unconditionally 
(delayed) 

RETS cond 

Return from subroutine 
conditionally 

CALL 

Call subroutine 

RPTB 

Repeat block of instructions 

CALLcond 

Call subroutine conditionally 

RPTS 

Repeat single instruction 

DBcond 

Decrement and branch 
conditionally (standard) 

SWI 

Software interrupt 

DBcondD 

Decrement and branch 
conditionally (delayed) 

TRAP cond 

Trap conditionally 


11.1.5 Interlocked Operations Instructions 

The interlocked operations instructions support multiprocessor communi¬ 
cation. Through the use of external signals, these instructions allow for pow¬ 
erful synchronization mechanisms. They also guarantee the Integrity of the 
communication and result in a high-speed operation. Refer to Section 7 for 
examples of the use of interlocked instructions. Table 11-5 lists the five in¬ 
terlocked operations instructions. 

Table 11-5. Interlocked Operations instructions 


INSTRUCTION 

DESCRIPTION 

INSTRUCTION 

DESCRIPTION 

LDFI 

Load floating-point value. 
Interlocked 

STFI 

Store floating-point value, 
interlocked 

LDII 

Load Integer, interlocked 

STII 

Store integer, interlocked 

SIGI 

Signal, interlocked 



11 . 1.6 


Parallel Operations Instructions 

The parallel-operations instructions group allows for a high degree of paral¬ 
lelism. Some of the TMS320C30 instructions can occur In pairs that will be 
executed In parallel. These parallel instructions provide: 
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• Parallel loading of registers, 

• Parallel arithmetic operations, or 

• Arithmetic/logical instructions used In parallel with a store Instruction. 

Each Instruction in a pair is entered as a separate source statement. The sec¬ 
ond Instruction in the pair must be preceded by two vertical bars (||). Table 
11-6 lists the valid instruction pairs. 
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Table 11-6. Parallellnstructions 


MNEMONIC 

DESCRIPTION 

OPERATION 

PARALLEL ARITHMETIC WITH STORE INSTRUCTIONS 

ABSF 

11 STF 

Absolute value of a floating-point 

|src2| -► dsti 

11 src3 dst2 

ABSI 

11 STI 

Absolute value of an integer 

tsrc2i dsti 

11 src3 dst2 

ADDF3 

II STF 

Add floating-point 

srcl + src2 dsti 

1! src3 dst2 

ADDI3 

11 STI 

Add integer 

srcl + src2 dsti 

11 src3 ->■ dst2 

AND3 

II STI 

Bitwise logical-AND 

srcl AND src2 -* dsti 

II src3 ”*■ dst2 

ASH3 

11 STI 

Arithmetic shift 

If count ^ 0: 

src2 << count -► dsti 

11 src3 dst2 

Else: 

src2 >> Icountl -*■ dsti 

11 src3 -* dst2 

FIX 

II STI 

Convert floating-point to integer 

Fix(src2) -► dsti 

11 src3 dst2 

FLOAT 

II STF 

Convert integer to floating-point 

Float(src2) dsti 

11 src3 dst2 

LDF 

II STF 

Load floating-point 

src2 -► dsti 

II src3 dst2 

LDI 

11 STI 

Load integer 

src2 dsti 

11 src3 dst2 

LSH3 

11 STI 

Logical shift 

If count ^ 0: 

src2 << count dsti 

11 src3 -► dst2 

Else: 

src2 >> Icountl -► dsti 

11 src3 dst2 

MPYF3 

11 STF 

Multiply floating-point 

src1 X src2 dsti 

11 src3 -► dst2 

MPYI3 

II STI 

Multiply integer 

srcl X src2 dsti 

11 src3 dst2 

NEGF 

II STF 

Negate floating-point 

0- src2 -► dsti 

11 src3 “► dst2 


LEGEND; 

srcl - register addr (RO 
src3 - register addr (RO 
dsti ~ register addr (RO 


-R7) 

src2 - indirect 

-R7) 

src4 - indirect 

-R7) 

dst2 - indirect 


addr (disp = 0, 1, IRO, IR1) 
addr (disp = 0, 1, IRO, IR1) 
addr (disp = 0, 1, IRO, IR1) 
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Table 11-6. Parallel Instructions (Concluded) 


MNEMONIC 

DESCRIPTION 

OPERATION 

I PARALLEL ARITHMETIC WITH STORE INSTRUCTIONS (Concluded) I 

NEGI 

I! STI 

Negate integer 

0 - src2 dsti 

11 src3 -► dst2 

NOT3 

II STI 

Complement 

dsti 

11 src3 -♦ dst2 

OR3 

II STI 

Bitwise logical-OR 

srcl OR src2 dsti 

11 src3 -*■ dst2 

STF 

II STF 

Store floating-point 

srcl -* dsti 

11 src3 dst2 

STI 

II STI 

Store integer 

srcl dsti 

II src3 dst2 

SUBF3 

II STF 

Subtract floating-point 

scrl - src2 -♦ dsti 

II src3 -♦ dst2 

SUBI3 

II STI 

Subtract integer 

srcl - src2 dsti 

II src3 -♦ dst2 

XOR3 

II STI 

Bitwise exclusive-OR 

srcl XOR src2 -♦ dsti 

II src3 dst2 

PARALLEL LOAD INSTRUCTIONS 

LDF 

II ldf 

Load floating-point 

src2 dsti 

11 src4 dst2 

LDI 

II LDI 

Load integer 

src2 dsti 

II src4 -► dst2 

PARALLEL MULTIPLY AND ADD/SUBTRACT INSTRUCTIONS | 

MPYF3 

I! ADDF3 

Multiply and add floating-point 

opi X op2 -► op3 

11 op4 + op5 op6 

MPYF3 

I! SUBF3 

Multiply and subtract floating-point 

opi X op2 -♦ op3 

II op4 - op5 -► op6 

MPYI3 

II ADDI3 

Multiply and add integer 

opi X op2 -♦ op3 

II op4 + op5 -♦ op6 

MPYI3 

II SUBI3 

Multiply and subtract integer 

opi X op2 -♦ op3 

11 op4 - op5 op6 


LEGEND: 

srcl “ register addr (R0-R7) 
src3 - register addr (R0-R7) 
dsti - register addr (R0-R7) 
op3 - register addr (RO or R1) 


src2 - indirect addr (disp = 0, 1, I RO, IR1) 

src4 - indirect addr (disp = 0, 1, I RO, IR1) 

dst2 - indirect addr (disp = 0, 1, I RO, IR1) 

op6 - register addr (R2 or R3) 


op1,op2,op4,op5 - Two of these operands must be specified using register addr, 
and two must be specified using indirect addr. 
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11.2 Condition Codes and Flags 

The TMS320C30 provides 20 condition codes that can be used with any of 
the conditional instructions, such as RETScoa7c/ or LD?cond. The conditions 
include signed and unsigned comparisons, comparisons to zero, and compar¬ 
isons based on the status of individual condition flags. Note that all condi¬ 
tional Instructions can accept the suffix 'U' to indicate unconditional 
operation. 

Seven condition flags provide Information related to properties of the result 
of arithmetic and logical instructions. The condition flags are stored In the 
status register (ST). These flags are modified by the majority of instructions 
according to whether a result is generated when performing the specified op¬ 
eration to infinite precision or an output Is written to the destination register. 
The formats for output values are shown in Table 11 -7. 


Table 11-7. Output Value Formats 


TyPE OF OPERATION 

OUTPUT FORMAT 

Floating-point 

Integer 

Logical 

8-bit exponent, 1 sign bit, 31-bit fraction 
32-bit integer 

32-bit unsigned integer 


The condition flags are affected by instructions In only the following cases: 

1) The destination register Is one of the extended-precision registers (RO - 
R7) 

2) The Instruction Is one of the compare instructions (CMPF, CMPF3, 
CMPI, CMPI3, TSTB, or TSTB3). 

Case 1 allows for modification of the registers used for addressing without 
affecting the condition flags during computation. Case 2 makes it possible to 
set the condition flags based upon the contents of any of the CPU registers. 

The following list defines the condition flags and describes how the flags are 
set by most instructions. For specific details of the effect of a particular In¬ 
struction on the condition flags, see the description of that instruction in 
Section 9.2. 

N Negative Condition Flag. Logical operations assign N the state 
of the MSB of the output value. For integer and floating-point oper¬ 
ations, N is set if the result is negative, and cleared otherwise. Zero 
is considered to be positive. 

Z Zero Condition Flag. For logical, integer, and floating-point oper¬ 
ations, Z is set If the output is 0, and cleared otherwise. 

V Overflow Condition Flag. For integer operations, V is set If the 
result does not fit into the format specified for the destination (I.e., 
-2^2 < result < 2^^ . -j j Otherwise, V is cleared. For floating-point 
operations, V is set if the exponent of the result is greater than 127, 
otherwlse,V is cleared. Logical operations always clear V. 

C Carry Flag. When an Integer addition is performed, C is set if a carry 
occurs out of the bit corresponding to the MSB of the output. When 
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an integer subtraction is performed, C is set if a borrow occurs into the 
bit corresponding to the MSB of the output. Otherwise, for integer 
operations, C is cleared. The carry flag is unaffected by floating-point 
and logical operations. 

UF Floating-Point Underflow Condition Flag. A floating-point 
underflow occurs whenever the exponent of the result is less than or 
equal to -128. If a floating-point underflow occurs, UF is set, and the 
output value Is set to 0. UF Is cleared If a floating-point underflow 
does not occur. 

LV Latched Overflow Condition Flag. LV is set whenever V (over¬ 
flow condition flag) is set. Otherwise, it is unchanged. LV may only 
be cleared by a processor reset or by modifying It in the status register 
(ST). 

LUF Latched Underflow Condition Flag. LUF is set whenever UF 
(floating-point underflow flag) Is set. LUF may only be cleared by a 
processor reset or by modifying it in the status register (ST). 

Table 11-8 lists the condition mnemonic, code, description, and flag for each 

of the 19 conditions. 
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Table 11-8. Condition Codes and Flags 


CONDITION 

CODE 

DESCRIPTION 

FLAG 

I UNCONDITIONAL COMPARES | 

I u 

00000 

Unconditional 

Don't care 

I UNSIGNED COMPARES | 

LO 

00001 

Lower than 

C 

LS 

00010 

Lower or same 

CORZ 

HI 

00011 

Higher than 

-C AND -Z 

HS 

00100 

Higher or same 

-C 

EQ 

00101 

Equal 

Z 

NE 

00110 

Not Equal 

-Z 

SIGNED COMPARES 

LT 

00111 

Less than 

N 

LE 

01000 

Less than or equal 

N OR Z 

GT 

01001 

Greater than 

-N AND -Z 

GE 

01010 

Greater than or equal 

~N 

EQ 

00101 

Equal 

Z 

NE 

00110 

Not equal 

-z 

I COMPARE TO ZERO | 

Z 

00101 

Zero 

z 

NZ 

00110 

Not zero 

~z 

P 

01001 

Positive 

-N AND -Z 

N 

00111 

Negative 

N 

NN 

01010 

Nonnegative 

-N 

I COMPARE TO CONDITION FLAGS I 

NN 

01010 

Nonnegative 

-N 

N 

00111 

Negative 

N 

NZ 

00110 

Nonzero 

-Z 

Z 

00101 

Zero 

z 

NV 

01100 

No overflow 

-V 

V 

01101 

Overflow 

V 

NUF 

OHIO 

No underflow 

-UF 

UF 

01111 

Underflow 

UF 

NC 

00100 

No carry 

-C 

C 

00001 

Carry 

C 

NLV 

10000 

No latched overflow 

-LV 

LV 

10001 

Latched overflow 

LV 

NLUF 

10010 

No latched floating-point underflow 

-LUF 

LUF 

10011 

Latched floating-point underflow 

LUF 

ZUF 

10100 

Zero or floating-point underflow 

Z OR UF 


^ Logical complement 
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11.3 Individual Instructions 

This section contains the individual assembly language Instructions for the 
TMS320C30. The instructions are listed in alphabetical order. Information, 
such as assembler syntax, operation, operands, encoding, description, cycles, 
status bits, mode bit, and examples. Is provided for each Instruction. An ex¬ 
ample Instruction precedes the individual Instruction listings to show the 
special format used and explain its content. 

Preceding the Individual instruction descriptions, the symbols and abbrevi¬ 
ations used in the individual instructions are defined. In addition, some op¬ 
tional syntax forms allowed by the assembler are described. 

A functional grouping of the instructions Is provided in Section 1.6. A com¬ 
plete instruction set summary can be found in Section 1.6.8. Appendix B lists 
the opcodes for all the Instructions. Refer to Section 6 for information on 
memory addressing. Code examples using many of the Instructions are given 
In Section 12, Software Applications. 

11.3.1 Symbols and Abbreviations 

Table 11-9 lists the symbols and abbreviations used in the Individual instruc¬ 
tion descriptions. 
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Table 11-9. Instruction Symbols 


SYMBOL 

MEANING 

src 

Source operand 

srcl 

Source operand 1 

src2 

Source operand 2 

src3 

Source operand 3 

src4 

Source operand 4 

dst 

Destination operand 

dsti 

Destination operand 1 

dst2 

Destination operand 2 

disp 

Displacement 

cond 

Condition 

count 

Shift count 

G 

General addressing modes 

T 

Three-operand addressing modes 

P 

Parallel addressing modes 

B 

Conditional-branch addressing modes 

ARn 

Auxiliary register n 

IRn 

Index register n 

Rn 

Register address n 

RC 

Repeat count register 

RE 

Repeat end address register 

RS 

Repeat start address register 

ST 

Status register 

C 

Carry bit 

GIE 

Global Interrupt enable bit 

N 

Trap vector 

PC 

Program counter 

RM 

Repeat mode flag 

SP 

System stack pointer 

|x| 

Absolute value of x 

X y 

Assign the value of x to destination y 

'K^man) 

Mantissa field (sign + fraction) of x 

x(exp) 

Exponent field of x 

opi 


II op2 

Operation 1 performed in parallel with operation 2 

X AND y 

Bitwise logical-AND of x and y 

X OR y 

Bitwise logIcal-OR of x and y 

xXORy 

Bitwise logical-XOR of x and y 

^x 

Bitwise logical-complement of x 

X << y 

Shift X to the left y bits 

X >> y 

Shift X to the right y bits 

* + +SP 

Increment SP and use incremented SP as address 

*SP- 

Use SP as address and decrement SP 




Assembly Language Instructions - Individual Instructions 


11.3.2 Optional Assembler Syntaxes 

The assembler allows a relaxed syntax form for some of the instructions. 
These optional forms simplify the assembly language so that special-case 
syntax can be ignored for some of the instructions. The following is a list of 
these optional syntax forms. 


• The destination register can be omitted on unary arithmetic and logical 
operations when the same register is used as a source. For example, 

ABSi RO,RO can be written as absi ro 

Instructions affected: ABSI, ABSF, FIX, FLOAT, NEGB, NEGF, NEGI, 
NORM, NOT, RND. 

• All 3-operand instructions can be written without the '3'. For example, 

ADDI3 R0,R1,R2 Can be written as addi ro,ri,R2 

Instructions affected: ADDC3, ADDF3, ADDI3, AND3, ANDN3, ASH3, 
LSH3, MPYF3, MPYI3, OR3, SUBB3, SUBF3, SUBI3, XOR3. 

This also applies to all the pertinent parallel instructions. 

• All 3-operand comparison instructions can be written without the '3'. 
For example, 

CMPIS RO,*ARO can be written as cmpi ro,*aro 

Instructions affected: CMPI3, CMPF3, TSTB3. 


• Indirect operands with an explicit 0 displacement are allowed. In 
3-operand or parallel instructions, operands with 0 displacement are 
automatically converted to "no-displacement" mode. For example: 

LDI *+AR0{0),Rl 
is legal 
Also 

ADDI3 *+AR0(0),R1,R2 is equivalent to ADDI3 *AR0,R1,R2 


Indirect operands can be written with no displacement. In which case a 
displacement of one is assumed. For example, 

LDI *AR0++(1),R0 can be written LDI *ARO++,RO 

All conditional instructions accept the suffix 'U' to indicate uncondi¬ 
tional operation. Also, the U can be omitted from unconditional short 
branch instructions. For example: 
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BU label 


can be written 


B label 


• Labels can be written with or without a trailing colon. For example: 


labelO: 

NOP 

labell 

NOP 

label2: 

NOP 


11-13 
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• Empty expressions are not allowed for the displacement in indirect 
mode: 

LDi *+ARO() ,R0 is not legal 

• Long immediate mode operands (destination of BR and CALL) can be 
written with an at-sign: 

BR label can be written BR @label 

• The LDP pseudo-op can be used to load a register (usually DP) with the 
8 MSBs of a relocatable address. The instruction is written: 

LDP addr,REG or LDP @addr,REG 

The at-sign Is optional. 

If the destination REG is the DP, It can be omitted. LDP generates a LDI 
instruction with an immediate operand, and a special relocation type. 

• Parallel instructions can be written in either order. For example: 

ADDi can be written as sti 

I 1 STI I I ADDI 

• The parallel bars indicating part 2 of a parallel instruction can be written 
anywhere on the line, from column 0 to the mnemonic. For example: 

ADDI can be written as addi 

1 1 STI I 1 STI 

• If the second operand of a parallel instruction Is the same as the third 
(destination register) operand, the third operand can be omitted. This 
allows the writing of 3-operand parallel instructions that 'look like' nor¬ 
mal 2-operand instruction. For example, 

addi *aro,R2,r2 can be written as ADDI *AR0,R2 

I I MPYI *AR1,R0,R0 1 1 MPYI *AR1,R0 

Instructions (applies to all parallel instructions that have a register sec¬ 
ond operand) affected: ADDI, ADDF, AND, MPYI, MPYF, OR, SUBI, 
SUBF, XOR. 

• All commutive operations In parallel instructions can be written in either 
order. For example, the ADDI part of a parallel instruction can be written 
in either of two ways: 

addi *AR0,R1,R2 or addi R1,*AR0,R2 

The instructions affected are parallel Instructions containing any of the 
following: ADDI, ADDF, MPYI, MPYF, AND, OR, XOR. 
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Assembly Language Instructions - Individual Instructions 


11.3.3 Individual Instruction Descriptions 

Each assembly language instruction for the TMS320C30 is described in this 
section. The instructions are listed in alphabetical order. An example instruc¬ 
tion precedes the individual Instructions to show the special format used and 
explain its content. This example instruction describes the assembler syntax, 
operation, operands, encoding, description, cycles, status bits, mode bit, and 
examples. 
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EXAMPLE 


Example Instruction 


Syntax INST <src>,<dst> 

or 

INST1 <src2>,<dst1> 

II INST2 <src3>,<dst2> 

Each instruction begins with an assembler syntax expression. Labels may 
be placed either preceding the command (instruction mnemonic) on the 
same line or on the preceding line in the first column. The optional com¬ 
ment field that concludes the syntax is not included in the syntax ex¬ 
pression. Space(s) are required between each field (label, command, 
operand, and comment fields). 

The syntax examples Illustrate the common one-line syntax and the two- 
line syntax used in parallel addressing. Note that the two vertical bars || that 
indicate a parallel addressing pair can be placed anywhere before the mne¬ 
monic on the second line. The first Instruction in the pair can have a label, 
but the second instruction cannot have a label. 

Operation \src\ dst 

or 

\src2\ dsti 
11 src3 dst2 

The Instruction operation sequence describes the processing that takes 
place when the instruction is executed. For parallel Instructions, the opera¬ 
tion sequence Is performed In parallel. Conditional effects of status register 
specified modes will be listed for conditional instructions such as Bcond. 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 s n ^ 27) 

0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 < n < 27) 
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or 

src2 indirect (disp = 0, 1, IRQ, IR1) 
dsti register (Rn1,0 ^ n1 7) 

src3 register (Rn2, 0 ^ n2 7) 

dst2 indirect (disp = 0, 1, IRQ, IR1) 

Operands are defined according to the addressing mode and/or the type of 
addressing used. Note that Indirect addressing uses displacements and the 
index registers. Refer to Section 6 for detailed information on addressing. 
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Example Instruction 


EXAMPLE 


Encoding 


31 

24 23 


1615 

87 

0 



1 

G 

I 1 1 1 

dst 

I 1 1 1 1 1 1 I 1 1 1 1 1 I 1 

src 


or 

31 24 23 1615 8 7 0 


1 ' 1 

—1—1—1—1— 
INS1IIINS2 

IQI 

— \ —1— 
0 0 0 

mmm 

—1—1—1—1— \ —1—1— 
dst2 

—1—\—1—1— \ —1—1— 
src2 


Encoding examples are shown using general addressing and parallel ad¬ 
dressing. The instruction pair for the parallel addressing example consists 
of INS1 and INS2. 


Description Instruction execution and its effect on the rest of the processor or memory 
contents are described. Any constraints on the operands imposed by the 
processor or the assembler are discussed. The description parallels and 
supplements the Information given by the operation block. 

Cycles 1 


The digit specifies the number of cycles required to execute the Instruction. 


Status Bits N 

Z 

V 

c 


Negative Condition Flag. 1 if a negative result is generated, 0 
otherwise. In some Instructions, this flag is the MSB of the output. 
For other instructions, this flag is unaffected. 

Zero Condition Flag. 1 if a zero result is generated, 0 otherwise. 
For logical and shift instructions, 1 if a zero output is generated, 0 
otherwise. This flag may be unaffected. 

Overflow Condition Flag. 1 If an integer or floating-point over¬ 
flow occurs, 0 otherwise. This flag may be unaffected. 

Carry Flag. 1 If a carry or borrow occurs, 0 otherwise. For shift 
instructions, this flag is set to the value of the last bit shifted out; 0 
for a shift count of 0. This flag may be unaffected. 


UF 

LV 


Floating-Point Underflow Condition Flag. If a floating-point 
underflow occurs, 0 otherwise. This flag may be unaffected. 

Latched Overflow Condition Flag. 1 if an integer or floating¬ 
point overflow occurs, unchanged otherwise. This flag may be un¬ 
affected. 
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LUF Latched Floating-Point Underflow Condition Flag. 1 If a 

floating-point underflow occurs, unchanged otherwise. This flag 
may be unaffected. 


The seven condition flags, stored in the status register (ST), are modified 
by the majority of instructions. They provide information as to the properties 
of the result or output of arithmetic or logical operations. 

Mode Bit OVM Overflow Mode Flag. In general, integer operations are affected 
by the OVM flag. 
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EXAMPLE 


Example Instruction 


Example 


INST @98AEh,R5 

Before instruction: 

DP = 80h 

R5 = 0766900000h 2.30562500e+02 

Memory at 8098AEh = BCDFh = 1.00001107e+00 
LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R5 = 0066900000h = 1.80126953e+00 
Memory at 8098AEh = BCDFh = 1.00001107e+00 
LUFLVUFNZVC = 0000000 

The sample code presented in the above format shows the effect of the 
code on system pointers (e.g., DP or SP), registers (e.g., R1 or RB), mem¬ 
ory at specific locations, and the seven status bits. The values given for the 
registers include the leading zeros to show the exponent in floating-point 
operations. Decimal conversions are provided for ail register and memory 
locations. The seven status bits are listed In the order in which they appear 
in the assembler and simulator (see Table 11-9 and Section 11.2 for further 
Information on these seven status bits). 



Absolute Value of Floating-Point 


ABSF 


Syntax ABSF <src>,<dst> 

Operation dst 

Operands src general addressing modes (G): 

0 0 register ( Rn, 0 ^ n 7) 
01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 7) 

Encoding 



Description The absolute value of the src operand is loaded into the dst register. The 
src and dst operands are assumed to be floating-point numbers. 

An overflow occurs If src (man) = BOOOOOOOh and src (exp) = 7Fh. The 
result is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 

Cycles 1 

Status Bits N 0 

Z 1 If a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 If a floating-point overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example ABSF R4, R7 

Before Instruction: 

R4 = 05C8000F971h = -9.90337307e+27 
R7 = 07D251100AEh = 5.48527255e+37 
LUFLVUFNZVC = 0000000 

After instruction: 

R4 = 05C8000F971h = -9.90337307e+27 
R7 = 05C7FFF068Fh = 9.90337307e+27 
LUFLVUFNZVC = 0000000 
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ABSFIISTF 


Parallel ABSF and STF 


Syntax ABSF <src2>,<dst1> 

II STF <src3>,<dst2> 

Operation \src2\ dsti 

11 src3 dst2 

Operands src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn1,0 n1 7) 
src3 register (Rn2, 0 ^ x\2 ^1) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 



Description A floating-point absolute value and a floating-point store are performed in 
parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (ABSF) writes 
to the same register, then STF accepts as input the contents of the register 
before it is modified by the ABSF. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

An overflow occurs if src (man) = BOOOOOOOh and src (exp) = 7Fh. The 
result is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 

Cycles 1 

Status Bits N 0 

Z 1 If a zero result Is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 If a floating-point overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 
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Parallel ABSF and STF 


ABSFIISTF 


Example 


ABSF *++AR3(IRl),R4 
I I STF R4,*-AR7(1) 

Before Instruction: 

AR3 = 809800h 
IR1 = OAFh 

R4 = 733C00000h = 1.79750e+02 
AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = -6.118750e+01 
Data at 8098C4h = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 8098AFh 
IR1 = OAFh 

R4 = 574C00000h = 6.118750e-H01 
AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = -6.118750e+01 
Data at 8098C4h = 733C000h = 1.79750e+02 
LUF LVUFNZVC = 0000000 



ABSI 


Absolute Value of Integer 


Syntax ABSI <src>,<dst> 

Operation l^rcl -♦ dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 

0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n < 27) 

Encoding 

31 24 23 1615 8 7 0 



—I—I—I—I—I— 
0 0 0 0 0 1 

—r~ 
G 

—1—I—I—1— 

dst 

—I—1—1—1— \ —1—1—1—1—1—1—1— \ —1—1— 

src 


Description The absolute value of the src operand is loaded into the dst register. The 
src and dst operands are assumed to be signed integers. 

An overflow occurs If src = BOOOOOOOh. If ST(OVM) = 1, the result is dst 
= 7FFFFFFFh. If ST(OVM) = 0, the result Is dst = OOOOOOOOh. 

Cycles 1 

Status Bits N 0 

Z 1 If a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 If an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 
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Mode Bit 


OVM Operation affected by OVM. 










Absolute Value of Integer 


ABSI 


Example 


Example 


ABSI R0,R0 
or ABSI RO 

Before Instruction: 

RO = OFFFFFFCBh = -53 

After Instruction: 

RO = 035h = 53 

ABSI *AR1,R3 

Before Instruction: 

AR1 = 20h 
R3 = Oh 

Data at 20h = OFFFFFFCBh = -53 

After Instruction: 

AR1 = 20h 
R3 = 35h = 53 

Data at 20h = OFFFFFFCBh = -53 



ABSIIISTI 


Parallel ABSI and STI 


Syntax ABSI <src2>,<dst1> 

II STI <src3>,<dst2> 

Operation \src2\ dst1 

11 src3 -► dst2 


Operands 


Encoding 


src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn1,0 < n1 < 7) 
src3 register (Rn2, 0 < n2 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


31 2423 1615 87 0 


gg 

QQimQi 


□QQ 

—r—T"— 

src3 

—I—I—I—I—I—I—I— 

dst2 

—I—I—I—I— \ —I—I— 

src2 


Description An integer absolute value and an integer store are performed in parallel. 

All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (ABSI) writes to the 
same register, then STI accepts as input the contents of the register before 
it Is modified by the ABSI. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 


Cycles 
Status Bits 


An overflow occurs if src = SOOOOOOOh. If ST(OVM) = 1, the result is dst 
= 7FFFFFFFh. If ST(OVM) = 0, the result is dst = OOOOOOOOh. 

1 

N 0 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an Integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 if an Integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 


Mode Bit OVM Operation affected by OVM. 
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Parallel ABSI and STI 


ABSIIISTI 


Example 


ABSI *“AR5(1),R5 
I I STI R1,*AR2—(IRl) 

Before Instruction: 

ARB = 8099E2h 
R5 = Oh 
R1 = 42h = 66 
AR2 = 8098FFh 
IR1 = OFh 

Data at 8099E1 h = OFFFFFFCBh = -53 
Data at 8098FFh = 2h = 2 
LUFLVUFNZVC = 0000000 

After Instruction: 

ARB = 8099E2h 
R5 = 35h = 53 
R1 = 42h = 66 
AR2 = 8098F0h 
IR1 = OFh 

Data at 8099E1 h = OFFFFFFCBh = -53 
Data at 8098FFh = 42h = 66 
LUFLVUFNZVC = 0000000 
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ADDC 


Add Integer with Carry 


Syntax ADDC <src>,<dst> 

Operation dst + src + Q ^ dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 n ^ 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 


31 


24 23 


1615 

87 

0 


lilfM 


B 

I 1 1 1 

dst 

I - 1 - 1 r 

0 

—1— \ —1—1—1—1—1—r 

src 

1 1 ' ' 1 


Description The sum of the dst and src operands and the C (carry) flag is loaded into 
the dst register. The dst and src operands are assumed to be signed inte¬ 
gers. 

Cycles 1 


Status Bits 


N 1 if a negative result Is generated, 0 otherwise. 

Z 1 If a zero result is generated, 0 otherwise. 

V 1 If an integer overflow occurs, 0 otherwise. 

C 1 If a carry occurs, 0 otherwise. 

UF 0 

LV 1 If an integer overflow occurs, unchanged otherwise. 
LUF Unaffected. 


Mode Bit OVM Operation affected by OVM. 


Example 
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ADDC R1,R5 

Before instruction: 

R1 = 00FFFF5C25h = -41,947 
R5 = 00FFFF019Eh = -65,122 
LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After Instruction: 

R1 = 00FFFF5C25h = -41,947 

R5 = 00FFFE5DC4h = -107,068 

LUF LV UF N Z V C = 0 0 0 1 0 0 1 
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Add Integer With Carry, 3-Operand 


ADDC3 


Syntax ADDC3 <src2> <src1>,<dst> 

Operation srcl + src2 + 0 dst 

Operands srcl three-operand addressing modes (T): 

0 0 register ( Rn1,0 < n1 ^ 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rn1,0 < n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 < n2 ^ 27) 

0 1 register (Rn2, 0 ^ n2 ^ 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 < n ^ 27) 

Encoding 



Description The sum of the srcl and src2 operands and the C (carry) flag is loaded into 
the dst register. The src1, src2, and dst operands are assumed to be signed 
integers. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 If a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a carry occurs, 0 otherwise. 

UF 0 

LV 1 if an Integer o/erflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 
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ADDC3 


Add Integer With Carry, 3-Operand 


Example 


Example 


ADDC3 *AR5++(IRO),R5,R2 
or 

ADDC3 R5,*AR5++(IRO),R2 

Before Instruction: 

ARB = 809908h 
IRO =10h 
R5 = 066h = 102 
R2 = Oh 

Data at 809908h = OFFFFFFCBh = -53 
LUF LVUFNZVC = 0000001 

After Instruction: 

ARB = 809918h 
IRO = lOh 
R5 = 066h = 102 
R2 = 032h = 50 

Data at 809908h = OFFFFFFCBh = -53 
LUF LV UF N Z V C = 0 0 0 0 0 0 1 

ADDC3 R2, R7, RO 

Before Instruction: 

R2 = 02BCh = 700 
R7 = 0F82h = 3970 
RO = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After Instruction: 

R2 = 02BCh = 700 
R7 = 0F82h = 3970 
RO = 0123Fh = 4671 

LUFLVUFNZVC = 0000000 
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Add Floating-Point 


ADDF 


Syntax ADDF <src>,<dst> 

Operation dst + src dst 

Operands src general addressing modes (G): 

0 0 register ( Rn, 0 ^ n ^ 7) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 7) 


Encoding 


31 

24 23 


1615 

87 

0 

iltllii 

miQQQQ 

1 

G 

1 — 1 1 1 

dst 

- 1 - 1 1 1 - 1 1 - 1 I 1 1 ; 1 - 1 I 1 - 

src 


Description The sum of the dst and src operands is loaded Into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 


Cycles 1 


Status Bits 


N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 
LUF 1 if a floating-point underflow occurs, unchanged otherwise. 


Mode Bit OVM Operation not affected by OVM. 


Example ADDF *AR4+-h (IRI) ,r5 

Before Instruction: 

AR4 = 809800h 
IR1 =12Bh 

R5 = 0579800000h = 6.23750e+01 

Data at 80992Bh = 86B2800h = 4.7031250e+02 

LUF LVUFNZVC = 0000000 

After Instruction: 

AR4 = 80992Bh 
IR1 =12Bh 

R5 = 09052C0000h = 5.3268750e+02 

Data at 80992Bh = 86B2800h = 4.7031250e+02 

LUFLVUFNZVC = 0000000 
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ADDF3 


Add Floating-Point, 3-Operand 


Syntax ADDF3 <src2>,<src1>,<dst> 

Operation srcl + src2 dst 

Operands srcl three-operand addressing modes (T): 

0 0 register ( Rn1,0 ^ n1 < 7) 

0 1 indirect (disp = 0, 1, IRO, IR1) 

1 0 register ( Rn1, 0 ^ n1 < 7) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 

0 0 register (Rn2, 0 < n2 < 7) 

0 1 register (Rn2, 0 < n2 < 7) 

1 0 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 < n < 7) 

Encoding 

31 24 23 1615 8 7 0 



Description The sum of the src1 and src2 operands is loaded into the dst register. The 
srcl, src2, and dst operands are assumed to be floating-point numbers. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 

Example ADDF3 R6,R5,R1 

or 

ADDF3 R5,R6,R1 
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Before Instruction: 

R6 = 086B280000h = 4.7031250e+02 
R5 = 0579800000h = 6.23750e+01 
R1 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R6 = 086B280000h = 4.7031250e+02 
R5 = 0579800000h = 6.23750e+01 
R1 = 09052C0000h - 5.3268750e+02 
LUFLVUFNZVC = 0000000 
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Add Floating-Point, 3-Operand 


ADDF3 


Example 


ADDF3 *+ARl(l),*AR7++(IR0),R4 

Before Instruction: 

AR1 = 809820h 
AR7 = 8099F0h 
IRQ = 8h 
R4 = Oh 

Data at 809821 h = 700F000h = 1.28940e+02 
Data at 8099F0h = 34C2000h = 1.27590e+01 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 809820h 
AR7 = 8099F8h 
IRO = 8h 

R4 = 070DB20000h = 1.41695313e+02 
Data at 809821 h = 700F000h = 1.28940e+02 
Data at 8099F0h = 34C2000h = 1.27590e+01 
LUFLVUFNZVC = 0000000 



ADDF3IISTF 


Parallel ADDF3 and STF 


Syntax ADDF3 <src2>,<src1>,<dst1> 

II STF <src3>,<dst2> 

Operation srcl + src2 dsti 

11 src3 -► dst2 


Operands srcl register (Rn1,0 ^ n1 ^7) 

src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn2, 0 s n2 £ 7) 
src3 register (Rn3, 0 n3 < 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 


31 24 23 1615 8 7 0 


gg 

gggi 

B 

—r“"n— 
dsti 

mmm 


— 1 — 1 — 1 — 1 — 1 — 1 — 1 — 
dst2 

—1 — 1 — 1 — 1 — 1 — 1 — 1 — 
src2 


Description 


Cycles 
Status Bits 


Mode Bit 


A floating-point addition and a floating-point store are performed in paral¬ 
lel. All registers are read at the beginning and loaded at the end of the ex¬ 
ecute cycle. This means that If one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (ADDF3) 
writes to the same register, then STF accepts as input the contents of the 
register before it Is modified by the ADDF3. 

If src2 and dst2 point to the same location, src2 Is read before the write to 
dst2. 

1 

N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 
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Parallel ADDF3 and STF 


ADDF3IISTF 


Example 


ADDF3 *+AR3(IRl),R2,R5 
I I STF R4,*AR2 

Before Instruction: 

AR3 = 809800h 
IR1 = 0A5h 

R2 = 070C800000h = 1.4050e+02 
R5 = Oh 

R4 = 057B400000h = 6.2812506+01 
AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1.79750e+02 
Data at 8098F3h = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 809800h 
IR1 = 0A5h 

R2 = 070C800000h = 1.4050e+02 
R5 = 0820200000h = 3.20250e+02 
R4 = 057B400000h = 6.281250e+01 
AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1.79750e+02 
Data at 8098F3h = 57B4000h = 6.28125e+01 
LUFLVUFNZVC = 0000000 




ADDI 


Add Integer 


Syntax ADDI <src>,<dst> 

Operation dst + src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 ^ n 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 n 27) 

Encoding 


31 

24 23 


1615 

87 

0 


QQQQQQ 

u 

1 1 1—1— 

dst 

1 1 1—1 1 1 1—1 1 1 1—1 1 1—1 

src 


Description 

Cycles 
Status Bits 


Mode Bit 
Example 


11 


The sum of the dst and src operands is loaded into the the dst register. The 
dst and src operands are assumed to be signed integers. 

1 

N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a carry occurs, 0 otherwise. 

UF 0 

LV 1 If an Integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation affected by OVM. 

ADDI R3,R7 

Before instruction: 

R3 = OFFFFFFCBh = -53 
R7 = 35h = 53 

LUFLVUFNZVC = 0000000 
After Instruction: 


R3 = OFFFFFFCBh =-53 
R7 = Oh 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 
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Add Integer, 3-Operand 


ADDIS 


Syntax ADDIS <src2>,<src1>,<clst> 

Operation srcl + src2 dst 

Operands srcl three-operand addressing modes (T); 

0 0 register (Rn1,0 < n1 ^ 27) 

0 1 indirect (disp = 0, 1, IRQ, IR1) 
1 0 register (Rnl, 0 < n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRQ, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 < n2 ^ 27) 

0 1 register (Rn2, 0 n2 ^ 27) 

1 0 indirect (disp = 0, 1, IRQ, IR1) 

1 1 indirect (disp = 0, 1, IRQ, IR1) 

dst register (Rn, 0 ^ n < 27) 

Encoding 



Description The sum of the srcl and src2 operands is loaded Into the dst register. The 
srcl, src2, and dst operands are assumed to be signed Integers. 

Cycles 1 

Status Bits N 1 if a negative result Is generated, 0 otherwise. 

Z 1 If a zero result is generated, 0 otherwise. 

V 1 if an Integer overflow occurs, 0 otherwise. 

C 1 If a carry occurs, 0 otherwise. 

UF 0 

LV 1 if an Integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 
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ADDIS 


Add Integer, 3-Operand 


Example ADDI3 R4,R7,R5 

Before Instruction: 

R4 = ODCh = 220 
R7 = OAOh = 160 
R5 = lOh = 16 

LUFLVUFNZVC = 0000000 

After instruction: 

R4 = ODCh = 220 
R7 = OAOh = 160 
R5 = 017Ch = 380 

LUFLVUFNZVC = 0000000 
Example ADDIS *-AR3+(l) ,*AR6—(IRO) ,R2 

Before Instruction: 

AR3 = 809802h 
AR6 = 809930h 
IRO =18h 
R2 = lOh = 16 

Data at 809801 h = 2AF8h = 11,000 
Data at 809930h = 3A98h = 15,000 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 809852h 
AR6 = 809918h 
IRO =18h 

R2 = 06598h = 26,000 
Data at 809801 h = 2AF8h = 11,000 
Data at 809930h = 3A98h = 15,000 
LUFLVUFNZVC = 0000000 
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Parallel ADDIS and STI 


ADDI3IISTI 


Syntax ADDI3 <src2> <src1>,<dst1> 

II STI <src3><dst2> 

Operation srcl + src2 -► dsti 

11 src3 -► dst2 

Operands srcl register (Rn1,0 < n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn2, 0 n2 < 7) 
src3 register (Rn3, 0 < n3 < 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 



Description An integer addition and an integer store are performed in parallel. All reg¬ 
isters are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STI) reads from a register 
and the operation being performed in parallel (ADDIS) writes to the same 
register, then STI accepts as input the contents of the register before it is 
modified by the ADDIS. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a carry occurs, 0 otherwise. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 

Example ADDI3 *AR0—(IR0),R5,R0 

I I STI R3,*AR7 

Before Instruction: 

ARO = 80992Ch 

IRQ = OCh 

R5 = ODCh = 220 

RO = Oh 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80992Ch = 12Ch = 300 

Data at 80983Bh = Oh 

LUF LVUFNZVC^OOOOOOO 
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ADDI3IISTI 


Parallel ADDIS and STI 


After Instruction: 

ARO = 809920h 

IRQ = OCh 

R5 = ODCh = 220 

RO = 208h = 520 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80992Ch = 12Ch = 300 

Data at 80983Bh = 35h = 53 

LUFLVUFNZVC = 0000000 
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Bitwise Logical-AND 


AND 


Syntax AND <src>,<dst> 

Operation dst kHD src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n ^ 27) 

0 1 direct 
1 0 indirect 

1 1 immediate (not sign-extended) 
dst register (Rn, 0 < n ^ 27) 

Encoding 



Description The bitwise logical-AND between the dst and src operands is loaded into 
the dst register. The dst and src operands are assumed to be unsigned in¬ 
tegers. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


Example AND R1, R2 

Before Instruction: 

R1 = 80h 
R2 = OAFFh 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After instruction: 

R1 = 80h 
R2 = 80h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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AND3 


Bitwise Logical-AND, 3-Operand 


Syntax AND <src2>,<src1><dst> 

Operation srcl AND src2 dst 

Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1,0 < n1 ^ 27) 

0 1 indirect (disp = 0, 1,IR0, IR1) 
1 0 register (Rnl, 0 < n1 ^ 27) 

1 1 indirect (disp = 0,1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 ^ n2 ^ 27) 

0 1 register (Rn2, 0 ^ n2 < 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 



Description The bitwise logical-AND between the srcl and src2 operands is loaded into 
the dst register. The srcl, src2, and dst operands are assumed to be un¬ 
signed integers. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 
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Bitwise Logical-AND, 3-Operand 


AND3 


Example 


Example 


AND3 *ARO—(IRO),*+ARl,R4 

Before instruction: 

ARO = 8098F4h 
IRO = 50h 
AR1 = 809951 h 
R4 = Oh 

Data at 8098A4h = 30h 
Data at 809952h = 123h 
LUFLVUFNZVC = 0000000 

After Instruction: 

ARO = 8098A4h 
IRO = 50h 
AR1 = 809951 h 
R4 = 020h 

Data at 8098A4h = 30h 
Data at 809952h = 123h 
LUFLVUFNZVC = 0000000 

AND3 *-AR5,R7,R4 

Before Instruction: 

AR5 = 80985Ch 
R7 = 2h 
R4 = Oh 

Data at 80985Bh = OAFFh 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR5 = 80985Ch 
R7 = 2h 
R4 = 2h 

Data at 80985Bh = OAFFh 
LUFLVUFNZVC = 0000000 
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AND3IISTI 


Parallel AND3 and STI 


Syntax 

AND <src2> <src1 >,<dst1 > 

II STI <src3>,<dst2> 

Operation 

src1 AND src2 dst1 

11 src3 dst2 


Operands 

srcl register (Rnl, 0 < 

src2 Indirect (disp = 0, 

dsti register (Rn2, 0 < 

src3 register (Rn3, 0 ^ 

dst2 Indirect (disp = 0, 

n1 s 7) 

1, IRO, IR1) 
n2 s 7) 
n3 s 7) 

1, IRO, IR1) 

Encoding 



31 

24 23 

1615 


1 1 1 


—1—1— 

dsti 

mmrn 

IQI 

1 1 1 1 1 1 1 

dst2 

— ^ —,— 1 — 1 —,— 1 — 1 — 
src2 


Description A bitwise logical-AND and an Integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed In parallel (AND3) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the AND3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
c/st2. 


Cycles 1 


Status Bits 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


11 
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Parallel AND3 and STI 


AND3IISTI 


Example 


AND3 *+ARl(IRO),R4,R7 
I I STI R3,*AR2 

Before Instruction: 

AR1 = 8099F1h 

IRQ = 8h 

R4 = 0A323h 

R7 = Oh 

R3 = 35h = 53 

AR2 = 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 8099F1 h 

IRO = 8h 

R4 = 0A323h 

R7 = 03h 

R3 = 35h = 53 

AR2 = 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = 35h = 53 

LUFLVUFNZVC = 0000000 




ANDN 


Bitwise Logical-AND with Complement 


Syntax ANDN <src>,<dst> 

Operation c/st AND -src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 

01 direct 
1 0 indirect 

1 1 immediate (not sign-extended) 
dst register (Rn, 0 ^ n ^ 27) 

Encoding 



Description The bitwise logical-AND between the dst operand and the bitwise logical 
complement C'^) of the src operand is loaded into the dst register. The dst 
and src operands are assumed to be unsigned integers. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example ANDN @980Ch,R2 

Before Instruction: 

DP = 80h 
R2 = 0C2Fh 

Data at 80980Ch = 0A02h 
LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 
R2 = 04yDh 

Data at 80980Ch = 0A02h 
LUFLVUFNZVC = 0000000 
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Bitwise Logical-ANDN, 3-Operand 


ANDN3 


Syntax AN DNS <src2> <src1> <dst> 

Operation srcl AND '^src2 -► dst 

Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1,0 ^ n1 ^ 27) 

0 1 indirect (disp = 0, 1, IRQ, IR1) 
1 0 register (Rn1,0 < n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRQ, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 < n2 27) 

0 1 register (Rn2, 0 ^ n2 ^ 27) 

1 0 indirect (disp = 0, 1, IRQ, IR1) 

1 1 indirect (disp = 0, 1, lOO, IR1) 

dst register (Rn, 0 ^ n < 27) 

Encoding 


31 24 23 1615 8 7 



0 



Description The bitwise logical-AND between the srcl operand and the bitwise logical 
complement (~) of the src2 operand is loaded Into the dst register. The 
srcl, src2, and dst operands are assumed to be unsigned integers. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


Example ANDN3 R5,R3,R7 

Before Instruction: 

R5 = 0A02h 
R3 = 0C2Fh 
R7 = Oh 

LUFLVUFNZVC = 0000000 



After Instruction: 

R5 = 0A02h 
R3 = 0C2Fh 
R7 = 042Dh 

LUFLVUFNZVC = 0000000 
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ANDN3 


Bitwise Logical-ANDN, 3-Operand 


Example 


ANDN3 R1,*AR5++(IR0),R0 

Before Instruction: 

R1 = OCFh 
ARB = 809825h 
IRQ = 5h 
RO = Oh 

Data at 809825h = OFFFh 
LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = OCFh 
ARB = 80982Ah 
IRO = Bh 
RO = 0F30h 

Data at 80982Bh = OFFFh 
LUFLVUFNZVC-0000000 




Arithmetic Shift 


ASH 


Syntax ASH <count> <dst> 

Operation If {count ^ 0): 

dst < < count dst 
Else: 

dst >> \count\ -► dst 


Operands count general addressing modes (G): 

0 0 register (Rn, 0 ^ n ^ 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (RN, 0 ^ n ^ 27) 

Encoding 


31 

24 23 


1615 

87 

0 


— 1 1 1 — 1 — 1 — 

0 0 0 1 1 1 

1 

G 

I 1 I 1 

dst 

1 1 1 — 1 1 1 1 1 1 1 I 1 1 1 1 

count 


Description The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 

C ^ dst *- 0 


Cycles 
Status Bits 


Mode Bit 


If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are sign-extended as it is right-shifted. Low-order bits are shifted out 
through the C (carry) bit 

Arithmetic right-shift 

sign of dst C 

If the count operand is zero, no shift is performed, and the C (carry) bit is 
set to 0. The count and dst operands are assumed to be signed Integers. 


11 


N MSB of the output 

Z 1 if a zero output is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C Set to the value of the last bit shifted out 0 for a shift count of 0. 

Unaffected if dst Is not RO - R7. 

UF 0 

LV 1 If an Integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation not affected by OVM. 
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ASH 


Arithmetic Shift 


Example 


Example 


ASH R1,R3 

Before Instruction: 

R1 = I0h = 16 
R3 = OAEOOOh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = lOh 

R3 = OEOOOOOOOOh 

LUF LVUFNZVC = 01 01 01 0 
ASH @98C3h,R5 

Before Instruction: 

DP = 80h 

R5 = 0AEC00001 h 

Data at 8098C3h = 0FFE8 = -24 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R5 = OFFFFFFAEh 

Data at 8098C3h = 0FFE8 = -24 

LUF LV UF N Z V C = 0 0 0 1 0 0 1 




Arithmetic Shift, 3-Operand 


ASH3 


Syntax ASH3 <count> <src>,<dst> 

Operation If {count ^ 0): 

src < < count dst 
Else: 

src >> \count\ dst 


Operands count three-operand addressing modes (T): 

0 0 register (Rn1,0 < n1 ^ 27) 

0 1 direct (disp = 0, 1JR0JR1) 

1 0 register (Rn1,0 ^ n1 < 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src three-operand addressing modes (T): 

0 0 register (Rn2, 0 ^ n2 < 27) 

0 1 register (Rn2, 0 < n2 ^ 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, lOO, IRl) 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 


31 2423 1615 87 0 



QQQQQIIIII 

B 


mom 



Description The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the src operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 

0 ^ src ^ 0 

If the count operand is less than zero, the src operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the src operand 
are sign-extended as it is right-shifted. Low-order bits are shifted out 
through the C (carry) bit. 

Arithmetic right-shift: 

sign of dst -► C 

If the count operand Is zero, no shift is performed, and the C (carry) bit Is 
set to 0. The count, src, and dst operands are assumed to be signed inte¬ 
gers. 


Cycles 


1 








ASH3 


Arithmetic Shift, 3-Operand 


Status Bits 


Mode Bit 
Example 


Example 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 

Unaffected if dst is not RO - R7. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

ASH3 *AR3"-(1),R5,R0 

Before Instruction: 

AR3 = 809921 h 
R5 = 02B0h 
RO = Oh 

Data at 809921 h = lOh = 16 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 809920h 

R5 = 000002 BOh 

RO = 02B00000h 

Data at 809921 h = lOh = 16 

LUFLVUFNZVC = 0000000 

ASH3 R1,R3,R5 

Before Instruction: 

R1 = 0FFFFFFF8h = -8 
R3 = OFFFFCBOOh 
R5 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = 0FFFFFFF8h = -8 
R3 = OFFFFCBOOh 
R5 = OFFFFFFCBh 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 



Parallel ASH3 and STI 


ASH3IISTI 


Syntax ASH3 <count> <src2>,<dst1> 

II STI <src3>,<dst2> 

Operation {count ^Q): 

src2 < < count -► dsti 
Else: 

src2 >> \count\ dsti 
11 src3 dst2 


Operands count register (Rn1,0 ^ n1 ^7) 
src2 indirect (disp = 0, IRO, IR1) 
dsti register (Rn2, 0 ^ n2 ^ 7) 
src3 register (Rn3, 0 < n3 ^ 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 


31 24 23 1615 8 7 0 


DD 

QHQQII 

wmm 

I 1 

count 

—1 — 1 — 
src3 

—1 — 1 — 1 — 1 — 1 — 1 — 1 — 
dst2 

— 1 — 1 — 1 — 1 — 1 — 1 — I — 

src2 


Description The seven least-significant bits of the count operand register are used to 
generate the two's-complement shift count of up to 32 bits. 

If the count operand Is greater than zero, the dst operand Is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 

C ^ src2 ^ 0 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are sign-extended as It is right-shifted. Low-order bits are shifted out 
through the C (carry) bit. 

Arithmetic right-shift: 

sign of src2 -► C 

If the count operand Is zero, no shift Is performed, and the C (carry) bit is 
set to 0. The count and dst operands are assumed to be signed integers. 

All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (ASH3) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the ASH3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Cycles 1 


11 
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ASH3IISTI 


Parallel ASH3 and STI 


Status Bits 


Mode Bit 
Example 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 
UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

ASH3 R1,*AR6++(IR1),R0 
I I STI R5,*AR2 

Before Instruction: 

AR6 = 809900h 

IRl = 8Ch 

R1 = 0FFE8h = -24 

RO = Oh 

R5 = 35h = 53 

AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 

Data at 8098A2h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR6 = 80998Ch 

IR1 = 8Ch 

R1 = 0FFE8h = -24 

RO = OFFFFFFAEh 

R5 = 35h = 53 

AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 

Data at 8098A2h = 35h = 53 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 



Branch Conditionally (Standard) 


Bcond 


Syntax Bcond <src> 

Operation If cond is true: 

If src is in register addressing mode (Rn 0^n^27), 
src PC. 

If src is in PC-relative mode (label or address), 
displacement + PC + 1 PC. 

Else, continue. 

Operands src conditional-branch addressing modes (B): 

0 register 
1 PC-relative 

Encoding 

31 24 23 1615 8 7 0 


cond 


register or displacement 


Description Bcond signifies a standard branch that executes in four cycles. A branch 
is performed if the condition is true. If the src operand is expressed in reg¬ 
ister addressing mode, the contents of the specified register are loaded into 
the PC. If the src operand is expressed in PC-relative mode, the assembler 
generates a displacement: displacement = label - (PC of branch instruction 
+ 1). This displacement is stored as a 16 bit signed integer in the 16 least 
significant bits of the branch instruction word. This displacement is added 
to the PC of the branch instruction plus 1 to generate the new PC. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). 

Cycles 4 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example BZ RO 

Before instruction: 

PC = 2B00h 
RO = 0003FF00h 

LUFLVUFNZVC = 0000000 

After instruction: 

PC = 3FF00h 
RO = 0003FF00h 

LUFLVUFNZVC = 0000000 







BcondD 


Branch Conditionally (Delayed) 


11 


Syntax 

Operation 


Operands 


BcondD <src> 

If cond is true: 

If src is in register addressing mode (Rn 0:^n<27), 
src PC. 

If src is in PC-relative mode (label or address), 
displacement + PC + 3 -► PC. 

Else, continue. 

src conditional-branch addressing modes (B): 

0 register 
1 PC-relative 


Encoding 


31 


24 23 


1615 

87 

0 

QummQ 

B 

IMWIii 

B 

1 1 1 1 

cond 

I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

register or displacement 


Description BcondD signifies a delayed branch that allows the three Instructions after 
the delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. 

A branch is performed if the condition Is true. If the src operand is ex¬ 
pressed In register addressing mode, the contents of the specified register 
are loaded Into the PC. If the src operand is expressed In PC-relative mode, 
the assembler generates a displacement: displacement = label - (PC of 
branch instruction + 3). This displacement Is stored as a 16 bit signed in¬ 
teger in the 16 least significant bits of the branch instruction. This dis¬ 
placement is added to the PC of the branch Instruction plus 3 to generate 
the new PC. The TMS320C30 provides 20 condition codes that can be 
used with this instruction (see Section 11.2 for a list of condition mne¬ 
monics, encoding, and flags). 


Cycles 1 


Status Bits 


N 

Unaffected. 

Z 

Unaffected. 

V 

Unaffected. 

c 

Unaffected. 

UF 

Unaffected. 

LV 

Unaffected. 

LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example BNZD 36 (36 = 24h) 

Before Instruction: 

PC = 50h 

LUFLVUFNZVC = 0000000 

After Instruction: 

PC = 77h 

LUFLVUFNZVC = 0000000 
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Branch Unconditionally (Standard) 


BR 


Syntax 

Operation 

Operands 

Encoding 

31 


BR <src> 
src PC 

src long-immediate addressing mode 

24 23 1615 


87 


0 


0 1 


1 0 0 0 0 


0 


src 


Description BR signifies a standard branch that executes in four cycles. An uncondi¬ 
tional branch is performed. The src operand is assumed to be a 24-bit un¬ 
signed integer. Note that bit 24 = 0 for a standard branch. 


Cycles 4 


Status Bits 


N 

Unaffected. 

Z 

Unaffected. 

V 

Unaffected. 

c 

Unaffected. 

UF 

Unaffected. 

LV 

Unaffected. 

LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example BR 805Ch 

Before instruction: 

PC = 80h 

LUFLVUFNZVC = 0000000 
After instruction: 

PC = 805Ch 

LUFLVUFNZVC^OOOOOOO 
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BRD 


Branch Unconditionally (Delayed) 


11 


Syntax 

Operation 

Operands 

Encoding 

31 


BRD <src> 
src PC 

src long-immediate addressing mode 


24 23 


—t—I—I—I—I—r 

0 1 1 0 0 0 Oil 


1615 

n—I—I—r—r 


87 


-!—I—I—r 

src 


1 —\—I —I—I—r 


Description BRD signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. 

An unconditional branch is performed. The src operand is assumed to be a 
24-bit unsigned Integer. Note that bit 24 = 1 for a delayed branch. 


Cycles 1 


Status Bits 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example BRD 2Ch 

Before instruction: 

PC = 1Bh 

LUFLVUFNZVC = 0000000 
After Instruction: 

PC = 2Ch 

LUFLVUFNZVC = 0000000 
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Call Subroutine 


CALL 


Syntax CALL <src> 

Operation Next PC -► * ++SP 

src -► PC 

Operands src long-immediate addressing mode 

Encoding 


31 


24 23 


1615 


87 


0 


0 1 1 0 0 0 1 


0 


src 


Description A call is performed. The next PC value is pushed onto the system stack. The 
src operand is loaded into the PC. The src operand is assumed to be a 
24-bit unsigned immediate operand. 

Cycles 4 


Status Bits 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example CALL 123456h 

Before Instruction: 


PC = 5h 
SP = 809801 h 

LUFLVUFNZVC = 0000000 


After Instruction: 

PC = 123456h 
SP = 809802h 
Data at 809802h = 6h 
LUFLVUFNZVC = 0000000 


11 
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CMJ-cond 


Call Subroutine Conditionally 


Syntax 

Operation 


Operands 


CiKLLcond <src> 

If cond is true; 

Next PC * ++SP 

If src is in register addressing mode (Rn 0<n^27), 
src PC. 

If src is in PC-relative mode (label or address), 
displacement + PC + 1 PC. 

Else, continue. 

src conditional-branch addressing modes (B): 

0 register 
1 PC-relative 


Encoding 


31 


24 23 

1615 

87 

0 

Bwmiiitii 

B 

□QOg 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

register or displacement 


Description A call is performed if the condition Is true. If the condition Is true, the next 
PC value Is pushed onto the system stack. If the src operand Is expressed 
In register addressing mode, the contents of the specified register are loaded 
into the PC. If the src operand Is expressed in PC-relative mode, the as¬ 
sembler generates a displacement: displacement = label - (PC of call In¬ 
struction + 1). This displacement is stored as a 16-bit signed Integer in the 
16 least significant bit of the call instruction word. This displacement is 
added to the PC of the call instruction plus 1 to generate the new PC. 

The TI\/IS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). 

Cycles 5 


11 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UP 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 

Mode Bit 

OVM 

Operation not affected by OVM. 
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Call Subroutine Conditionally 


CALLcond 


Example 


CALLNZ R5 

Before Instruction: 

PC =123h 
SP = 809835h 
R5 = 789h 

LUFLVUFNZVC = 0000000 
After Instruction: 

PC = 789h 
SP = 809836h 
R5 = 789h 

Data at 809836h = 124h 
LUFLVUFNZVC = 0000000 



CMPF 


Compare Floating-Point 


Syntax CMPF <src>,<dst> 

Operation dst - src 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 7) 

01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 7) 


Encoding 


31 

24 23 


1615 

87 

0 

1 1 

0 0 0 

QQQQQQ 

{' 1 1 

G 

1 1 1 1 

dst 

1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 

src 


Description The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and 
src operands are assumed to be floating-point numbers. 


Cycles 1 


Status Bits 


N 1 If a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 If a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 
LUF 1 If a floating-point underflow occurs, unchanged otherwise. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


CMPF *+AR4,R6 

Before Instruction: 

AR4 = 8098F2h 

R6 = 070C800000h = 1.4050e+02 

Data at 8098F3h = 070C8000h = 1.4050e+02 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 8098F2h 

R6 = 070C800000h = 1.4050e+02 

Data at 8098F3h = 070C8000h = 1.4050e+02 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 
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Compare Floating-Point, 3-Operand 


CMPF3 


Syntax CMPF3 <src2><src1> 

Operation srcl - src2 


Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1,0 < nl < 7) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rnl, 0 ^ n1 ^7) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 ^ n2 ^ 7) 

0 1 register (Rn2, 0 ^ n2 ^ 7) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 


Encoding 


31 24 23 1615 8 7 0 


—1—1 — 
0 0 1 

—1— \ —1—I—I— 
0 0 0 1 1 0 

T 

—1—1—1—1— 
0 0 0 0 0 

— r“"T — 1 —1—1—1—1 — 
srcl 

—1—1—1— \ —1—1— 1 — 
src2 


Description The src2 operand is subtracted from the srcl operand. The result is not 
loaded into any register, thus allowing for nondestructive compares. The 
srcl and src2 operands are assumed to be floating-point numbers. Al¬ 
though this instruction has only two operands, it is designated as a three 
operand instruction since operands are specified in the three operand for¬ 
mat. 


Cycles 1 


Status Bits 


N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a flaoting-point overflow occurs, unchanged otherwise. 
LUF 1 if a floating-point underflow occurs, unchanged otherwise. 


Mode Bit OVM Operation not affected by OVM. 

Example CMPF3 *AR2 , *AR3 — (1) 

Before instruction: 

AR2 = 809831 h 
AR3 = 809852h 

Data at 809831 h = 77A7000h = 2.5044e+02 
Data at 809852h = 57A2000h = 6.253125e+01 
LUFLVUFNZVC = 0000000 


11 


After Instruction: 

AR2 = 809831 h 
AR3 = 809851 h 

Data at 809831 h = 77A7000h = 2.5044e+02 
Data at 809852h = 57A2000h = 6.253125e+01 
LUF LV UF N Z V C = 0 0 0 1 0 0 0 
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CMPI 


Compare Integer 


Syntax CMPI <src><dst> 

Operation dst - src 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 27) 


Encoding 


31 

24 23 


1615 

87 

0 

gog 

HUDI 

B 

1 1 1 1 

dst 

I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 

src 


Description The src operand is subtracted from the dst operand. The result Is not loaded 
Into any register, thus allowing for nondestructive compares. The dst and 
src operands are assumed to be signed integers. 


Cycles 1 


Status Bits 


N 1 if a negative result is generated, 0 otherwise. 

Z 1 If a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 if an Integer overflow occurs, unchanged otherwise. 
LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


CMPI R3,R7 

Before instruction: 

R3 = 898h = 2200 
R7 = 3E8h = 1000 

LUFLVUFNZVC = 0000000 

After Instruction: 

R3 = 898h = 2200 
R7 = 3E8h = 1000 

LUF LVUFNZVC = 0001 000 
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Compare Integer, 3-Operand 


CMPI3 


Syntax CMPI3 <src2> <src1> 

Operation srcl - src2 


Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1,0 ^ n1 < 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rn1,0 ^ n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T); 
0 0 register (Rn2, 0 < n2 ^ 27) 

0 1 register (Rn2, 0 ^ n2 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 


Encoding 


31 24 23 1615 8 7 0 


liliSH 


B 

QQQQHH 




Description 


Cycles 
Status Bits 


The src2 operand is subtracted from the src1 operand. The result is not 
loaded Into any register, thus allowing for nondestructive compares. The 
src1 and src2 operands are assumed to be signed integers. Although this 
instruction has only two operands, it is designated as a three operand in¬ 
struction since operands are specified in the three operand format. 

1 

N 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example CMPI3 R7 ,R4 

Before Instruction: 

R7 = 03E8h =1000 
R4 = 0898h = 2200 

LUF LVUFNZVC = 0000000 


11 


After Instruction: 

R7 = 03E8h =1000 
R4 = 0898h = 2200 

LUFLVUFNZVC = 0000000 
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DBcond 


Decrement and Branch Conditionally (Standard) 


11 


Syntax D^cond <kHn>,<src> 

Operation ARn - 1 ARn 

If cond is true and ARn ^ 0 : 

If src is in register addressing mode (Rn 0<n:^27) 
src -* PC. 

If src is in PC-relative mode (label or address) 
displacement + PC + 1 PC. 

Else, continue. 


Operands src conditional-branch addressing modes (B): 
0 register 
1 PC-relative 

ARn register (0 ^ n ^ 7) 


Encoding 


31 


24 23 


1615 

87 

0 

QQHQim 

B 

I 1 

ARn 

D 

IIIIIIQQIIIIIII^ 

—1 I 1 1 1 — 1 1 1 1 1 1 — 1 1 1 1 

register or displacement 


Description DBco/?^/signifies a standard branch that executes in four cycles. The spe¬ 
cified auxiliary register is decremented and a branch Is performed if the 
condition Is true and the specified auxiliary register is greater than or equal 
to zero. 

The auxiliary register is treated as a 24-bit signed integer. The most-signi¬ 
ficant eight bits are unmodified by the decrement operation. The compar¬ 
ison of the auxiliary register uses only the 24 least-significant bits of the 
auxiliary register. Note that the branch condition does not depend on the 
auxiliary register decrement. 

If the src operand is expressed in register addressing mode, the contents of 
the specified register are loaded Into the PC. If the src operand is expressed 
In PC-relative addressing mode, the assembler generates a displacement: 
displacement = label - (PC of branch instruction + 1). This integer Is 
stored as a 16 bit signed integer in the 16 least significant bits of the branch 
instruction word. This displacement is added to the PC of the branch in¬ 
struction plus 1 to generate the new PC. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). 

Cycles 4 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 

Mode Bit 

OVM 

Operation not affected by OVM. 
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Decrement and Branch Conditionally (Standard) 


DBcond 


Example 


DBLT AR3,R2 

Before Instruction: 

PC = 5Fh 
AR3 = 12h 
R2 = 9Fh 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 
After Instruction: 

PC = 9Fh 
AR3 =11h 
R2 = 9Fh 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 


11 
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DBco/ic/D 


Decrement and Branch Conditionally (Delayed) 


11 


Syntax DBcondD <M{n>,<src> 

Operation ARn - 1 ARn 

If cond is true: 

If src is in register addressing mode (Rn 0:^n<27) 
src PC 

If src Is in PC-relative mode (label or address) 
displacement + PC + 3 PC. 

Else, continue. 


Operands src conditional-branch addressing modes (B): 
0 register 
1 PC-relative 

ARn register (0 ^ n ^ 7) 


Encoding 


31 


24 23 


1615 

87 

0 


p 

mmm 

p 

■in 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

register or displacement 


Description DBcondD signifies a delayed branch that allows the three Instructions after 
the delayed branch to be fetched before the PC Is modified. The effect Is a 
single-cycle branch. The specified auxiliary register is decremented and a 
branch Is performed If the condition is true and the specified auxiliary reg¬ 
ister greater than or equal to zero. 

The auxiliary register Is treated as a 24-bit signed integer. The most-signi¬ 
ficant eight bits are unmodified by the decrement operation. The compar¬ 
ison of the auxiliary register uses only the 24 least-significant bits of the 
auxiliary register. Note that the branch condition does not depend on the 
auxiliary register decrement. 

If the src operand Is expressed In register addressing mode, the contents of 
the specified register are loaded into the PC. If the src is expressed in 
PC-relative addressing, the assembler generates a displacement: displace¬ 
ment = label - (PC of branch instruction + 3). This displacement is added 
to the PC of the branch Instruction plus 3 to generate the new PC. Note 
that bit 21 = 1 for a delayed branch. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). 

Cycles 1 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 

Mode Bit 

OVM 

Operation not affected by OVM. 
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Decrement and Branch Conditionally (Delayed) 


DBco/idD 


Example 


DBZD AR5,$+110h 

Before Instruction: 

PC = Oh 
AR5 = 67h 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 

After Instruction: 

PC = 11 Oh 
AR5 = 66h 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 



FIX 


Floating-Point to Integer Conversion 


Syntax FIX <src>,<dst> 

Operation fix(src) -♦ dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n 7) 

01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n 27) 

Encoding 



Description The floating-point operand src is converted to the nearest integer less than 
or equal to it in absolute value, and the result is loaded Into the dst register. 
The src operand is assumed to be a floating-point number and the dst op¬ 
erand a signed integer. 

The exponent field of the result register (If it has one) is not modified. 

Integer overflow occurs when the floating-point number is too large to be 
represented as a 32-bit two's-complement Integer. In the case of integer 
overflow, the result will be saturated in the direction of overflow. 

Cycles 1 

Status Bits N 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 If an integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example FIX R1,R2 


11 


Before Instruction: 

R1 = 0A282CCCCCh = -1.3454e+3 
R2 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = 0A282CCCCCh = -13454e+3 
R2 = 541 h = 1345 

LUF LVUFNZVC = 0000000 
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Parallel FIX and STI 


FIXIISTI 


Syntax FIX <src2> <dst1> 

II STI <src3><dst2> 

Operation i\x{src2) dst1 

11 src3 dst2 


Operands 


src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn1,0 ^ n1 < 7) 
src3 register (Rn2, 0 ^ n2 7) 
dst2 indirect (disp = 0, 1, IRO, 1R1) 


Encoding 


31 24 23 1615 8 7 0 


1' 1 

QQOmQ 



— 1 —1— 
src3 

—1—1—1—1— r—T — 1 — 

dst2 



Description A floating-point to integer conversion is performed. All registers are read 
at the beginning and loaded at the end of the execute cycle. This means 
that if one of the parallel operations (STI) reads from a register, and the 
operation being performed in parallel (FIX) writes to the same register, then 
STI accepts as Input the contents of the register before it Is modified by FIX. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 


Cycles 
Status Bits 


Mode Bit 


Integer overflow occurs when the floating-point number is too large to be 
represented as a 32-bit two's-complement Integer. In the case of Integer 
overflow, the result will be saturated in the direction of overflow. 

1 

N 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an Integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation affected by OVM. 


11 
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FIXIISTI 


Parallel FIX and STI 


Example 


FIX *++AR4(l),Rl 
II STI R0,*AR2 

Before Instruction: 

AR4 = 8098A2h 
R1 = Oh 

RO = ODCh = 220 
AR2 = 80983Ch 

Data at 8098A3h = 733C000h = 1.79506+02 
Data at 80983Ch = Oh 
LUFLVUFNZVC = 0000000 

After instruction: 

AR4 = 8098A3h 
R1 - 0B3h = 179 
RO - ODCh = 220 
AR2 = 80983Ch 

Data at 8098A3h = 733C000h = 1.79750e+02 
Data at 80983Ch = ODCh = 220 
LUFLVUFNZVC = 0000000 



Integer to Floating-Point Conversion 


FLOAT 


Syntax FLOAT <src>,<c/st> 

Operation float(src) dst 


Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n < 7) 


Encoding 


31 

24 23 


1615 

87 

0 

1 1 

0 0 0 

— 1 — 1 — 1 — 1 — 1 — 

0 0 10 11 

1 

G 

1 1 , , 

dst 

1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 

src 


Description The integer operand src is converted to the floating-point value equal to it, 
and the result loaded into the dst register. The src operand is assumed to 
be a signed integer, and the dst operand a floating-point number. 


Cycles 1 


Status Bits 


N 1 if a negative result Is generated, 0 otherwise. 
Z 1 if a zero result is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


11 


11-71 










FLOAT 


Integer to Floating-Point Conversion 


Example 


FLOAT *+-fAR2 (2) ,R5 

Before instruction: 

AR2 = 809800h 

R5 = 034C2000h = 1.27578125e+01 
Data at 809802h = OAEh = 174 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR2 = 809802h 

R5 = 072E00000h = 1.74e+02 

Data at 809802h = OAEh = 174 

LUF LVUFNZVC = 0000000 



Parallel FLOAT and STF 


FLOATIISTF 


Syntax FLOAT <src2> <dst1> 

II STF <src3><dst2> 

Operation float(5/'c2) dst1 

11 src3 dst2 

Operands src2 indirect (disp = 0, 1, IRO, IR1) 
dst1 register (Rn1,0 < n1 ^7) 
src3 register (Rn2, 0 ^ n2 7) 
dst2 register (disp = 0, IRO, IR1) 

Encoding 

31 24 23 1615 8 7 0 


—1—1—1—1—' 

0 10 11 

-1-1— 

dsti 

—1—1— 
0 0 0 

—1-1— 

src3 

—1—1—1—1—1—1—1— 
dst2 



Description An integer to floating-point conversion is performed. All registers are read 
at the beginning and loaded at the end of the execute cycle. This means that 
if one of the parallel operations (STF) reads from a register and the opera¬ 
tion being performed in parallel (FLOAT) writes to the same register, then 
STF accepts as input the contents of the register before it is modified by 
FLOAT. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Cycles 1 

Status Bits IM 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


Example FLOAT *+AR2 ( IRO) ,R6 

II STF R7,*AR1 

Before Instruction: 

AR2 = 8098C5h 
IRO = 8h 
R6 = Oh 

R7 = 034C200000h = 1.27578125e+01 
AR1 = 809933h 

Data at 8098CDh = OAEh = 174 
Data at 809933h = Oh 
LUFLVUFNZVC = 0000000 
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FLOATIISTF 


Parallel FLOAT and STF 


After Instruction: 

AR2 = 8098C5h 
IRQ = 8h 

R6 = 072E000000h = 1.740e+02 
R7 = 034C200000h = 1.27578125e+01 
AR1 = 809933h 

Data at 8098CDh = OAEh = 174 

Data at 809933h = 034C2000h = 1.27578125e+01 

LUFLVUFNZVC = 0000000 




Interrupt Acknowledge 


lACK 


Syntax 

Operation 

Operands 


Encoding 

31 



lACK <src> 

Perform a dummy read ope ration with lACK = 0. 
At end of dummy read, set lACK to 1. 

src general addressing modes (G): 

0 1 direct 
1 0 indirect 


24 23 


1615 


87 


0 



Description A dummy read operation is performed with lACK = 0. At the end of the 
dummy read, lACK is set to 1. This instruction can be used to generate an 
external interrupt acknowledge. If the address specified is off-chip, a read 
operation from that address is performed. The lACK signal and the address 
can then be used to signal Interrupt acknowledge to external devices. The 
data read by the processor is unused. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


Example lACK *AR5 

Before Instruction: 

lACK = 1 
PC = 300h 

LUFLVUFNZVC = 0000000 
After Instruction: 

lACK = 1 
PC = 301 h 

LUFLVUFNZVC = 0000000 
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IDLE 


Idle Until Interrupt 


Syntax 

IDLE 




Operation 

1 ST(GIE) 

Next PC PC 

Idle until interrupt. 




Operands 

None 




Encoding 





31 

24 23 

1615 

87 

0 



Description The global interrupt enable bit is set the next PC value is loaded into the 
PC, and the CPU idles until an interrupt is received. When the interrupt is 
received, the contents of the PC are pushed on the active system stack. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 
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Load Floating-Point Exponent 


LDE 


Syntax LDE <src>,<dst> 

Operation 5/'c(exp) c/5?(exp) 

Operands src general addressing modes (G); 

0 0 register (Rn, 0 ^ n ^ 7) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 < n < 7) 

Encoding 


31 


24 23 


1615 

87 

0 

liHilW 

o 

o 

1 1 1 

110 1 

B 

1 1 1 1 

dst 

1 1 1 — 1 1 1 1 1 1 1 1 1 1 1 1 

src 


Description The exponent field of the src operand is loaded into the exponent field of 
the dst register. No modification of the dst register mantissa field is made 
unless the value of the exponent loaded Is the reserved value of the expo¬ 
nent for zero in the precision of the src operand. Then the mantissa field 
of the dst register is set to zero. The src and dst operands are assumed to 
be floating-point numbers. 


Cycles 

1 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example LDE RO, R5 

Before Instruction: 

RO = 0200056F30h = 4.00066337e+00 
R5 = 0A056FE332h = 1.06749648e+03 
LUFLVUFNZVC = 0000000 

After Instruction: 

RO = 0200056F30h = 4.00066337e+00 
R5 = 02056FE332h = 4.16990814e+00 
LUFLVUFNZVC = 0000000 


11 
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LDF 


Load Floating-Point 


Syntax LDF <src>,<dst> 

Operation src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n 7) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 < n < 7) 

Encoding 


31 

24 23 


1615 

87 

0 

liWIH 

QQmmQ 

1 ' 1 

G 


1 1 1 1 1 1 1 1 1 1 1 1 1 — 1 1 

src 


Description The src operand is loaded into the dst register. The dst and src operands 
are assumed to be floating-point numbers. 

Cycles 1 


Status Bits 

N 

1 if a negative result is generated, 0 otherwise. 


Z 

1 if a zero result is generated, 0 otherwise. 


V 

0 


c 

Unaffected. 


UF 

0 


LV 

Unaffected. 


LUF 

Unaffected. 

Mode Bit 

OVM Operation not affected by OVM. 


Example 


11 


LDF @9800h,R2 

Before Instruction: 

DP = 80h 
R2 = Oh 

Data at 809800h = 10C52A00h = 2.19254303e+00 
LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R2 = 010C52A000h = 2.19254303e+00 

Data at 809800h = 10C52A00h = 2.19254303e+00 

LUF LVUFNZVC = 0000000 


11-78 







Load Floating-Point Conditionally 


LDFcond 


Syntax LDFcond <src>,<dst> 

Operation If cond is true: 

src -> dst. 

Else: 

dst is unchanged. 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 ^ n ^ 7) 

0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n 7) 

Encoding 


31 2423 1615 87 0 


—1—1—1— 
0 10 0 

—1—1— \ —1— 
cond 

1 

G 

—1—1—1—1— 
dst 

—1—1—1—1—1— \ —1—1—1—1—1—1—1—1—1— 

src 


Description If the condition is true, the src operand is loaded into the dst register. 

Otherwise, the dst register is unchanged. The dst and src operands are 
assumed to be floating-point numbers. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). Note that an LDFU (load floating-point unconditionally) in¬ 
struction is useful for loading R0-R7 without affecting condition flags. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 
Example LDFZ R3, R5 

Before instruction: 

R3 = 2CFF2CD500h = 1.77055560e+13 
R5 = 5F0000003Eh = 3.96140824e+28 
LUF LVUFNZVC = 00001 00 



After Instruction: 

R3 = 2CFF2CD500h = 1.77055560e+13 
R5 = 2CFF2CD500h = 1.77055560e+13 
LUF LV UF N Z V C = 0 0 0 0 1 0 0 
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LDFI 


Load Floating-Point, Interlocked 


Syntax LDFI <src>,<dst> 

Operation Signal interlocked operation. 

src dst 


Operands src general addressing modes (G): 
01 direct 
1 0 indirect 

dst register (Rn, 0 ^ n ^ 7) 

Encoding 


31 

24 23 


1615 

87 

0 

itilMlii 

□aomin 

Q 

I—I—I I— 

dst 

—I—I—I—I—I—I I I I I I I I I I 

src 


Description The src operand is loaded Into the dst register. An interlocked operation is 
signaled over XFO and XF1. The src and dst operands are assumed to be 
floating-point numbers. Note that only direct and Indirect modes are al¬ 
lowed. Refer to Section 7.3 for detailed description. 


Cycles 

1 If XF1 = 0 (see Section 7.3) 

Status Bits 

N 

1 If a negative result Is generated, 0 otherwise. 


Z 

1 if a zero result is generated, 0 otherwise. 


V 

0 


c 

Unaffected. 


UF 

0 


LV 

Unaffected. 


LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


LDFI *+AR2,R7 

Before Instruction: 

AR2 = 8098F1 h 
R7 = Oh 

Data at 8098F2h = 584C000h = -6.28125e+01 
LUF LVUFNZVC = 0000000 

After instruction: 

AR2 = 8098F1 h 

R7 = 0584C00000h = -6.28125e+01 

Data at 8098F2h = 584C000h = -6.28125e+01 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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Parallel LDF and LDF 


LDFIILDF 


Syntax 

Operation 

Operands 


Encoding 

31 


1 1 


LDF <src2> <dst2> 

II LDF <src1> <dst1> 

src2 -► dst2 
\\src1 -► dsti 

srcl indirect (disp = 0, 1, IRQ, IR1) 
dsti register (Rn1,0 < n1 ^7) 
src2 indirect (disp = 0, 1, IRQ, IR1) 
dst2 register (Rn2, 0 ^ n2 < 7) 


24 23 


1615 


87 


0 



Description Two floating-point loads are performed in parallel. If the LDFs load the 
same register, the assembler issues a warning. The result is that of LDF 
<src2> <dst2>. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


Example LDF *—AR1(IR0),R7 

II LDF *AR74-+(1) ,R3 

Before Instruction: 

AR1 = 80985Fh 
IRQ = 8h 
R7 = Oh 

AR7 = 80988Ah 
R3 = Oh 

Data at 809857h = 70C8000h = 1.4050e+02 
Data at 80988Ah = 57B4000h = 6.281250e+01 
LUFLVUFNZVC = 0000000 

After instruction: 

AR1 = 809857h 
IRO = 8h 

R7 = 070C800000h = 1.4050e+02 
AR7 = 80988Bh 

R3 = 057B400000h = 6.281250e+01 
Data at 809857h = 70C8000h = 1.4050e+02 
Data at 80988Ah = 57B4000h = 6.281250e+01 
LUFLVUFNZVC = 0000000 
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LDFIISTF 


Parallel LDF and STF 


Syntax LDF <src2>,<dst1> 

II STF <src3>,<dst2> 

Operation src2 dsti 

II src3 dst2 


Operands src2 indirect (disp = 0, 1, IRQ, IR1) 
dsti register (Rnl, 0 n1 ^7) 
src3 register (Rn2, 0 £ n2 ^ 7) 
dst2 indirect (disp = 0,1, IRQ, IR1) 

Encoding 


31 2423 1615 8 7 0 


m 

QomoQ 


□QQ 





Description A floating-point load and a floating-point store are performed in parallel. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 


Cycles 

1 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


LDF *AR2—(1),R1 
II STF R3,*AR4++(IR1) 

Before Instruction: 

AR2 = 8098E7h 
R1 = Oh 

R3 = 057B400000h = 6.28125e+01 
AR4 = 809900h 
IR1 =10h 

Data at 8098E7h = 70C8000h = 1.4050e-i-02 
Data at 809900h = Oh 
LUFLVUFNZVC = 0000000 
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Parallel LDF and STF 


LDFIISTF 


After Instruction: 

AR2 = 8098E6h 

R1 = 070C800000h = 1.4050e+02 
R3 = 057B400000h = 6.28125e+01 
AR4 = 80991 Oh 
IR1 =lOh 

Data at 8098E7h = 70C8000h = 1.4050e+02 
Data at 809900h = 57B4000h = 6.28125e+01 
LUFLVUFNZVC = 0000000 




LDI 


Load Integer 


Syntax LDI <src><dst> 

Operation src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 ^ n < 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 



Description The src operand is loaded into the dst register. The dst and src operands 
are assumed to be signed integers. An alternate form of LDI, LDP, is used 
to load the data page pointer register (DP), or any other register with the 
eight MSBs of a relocatable address. See Section 11.3.2. 

Cycles 1 

Status Bits N 1 If a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example LDI *-ARl (IRQ) ,R5 

Before instruction: 

AR1 = 2Ch 
IRQ = 5h 

R5 = 3C5h = 965 
Data at 27h = 26h = 38 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 2Ch 

IRQ = 5h 

R5 = 26h = 38 

Data at 27h = 26h = 38 

LUFLVUFNZVC = 0000000 
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Load Integer Conditionally 


LD\cond 


Syntax LDlcond <src>,<c/st> 

Operation If cond is true: 

src -► dst, 

Else: 

dst is unchanged. 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 

0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 

31 24 23 1615 8 7 0 


cond 




src 


Description 


Cycles 
Status Bits 


Mode Bit 
Example 


If the condition Is true, the src operand Is loaded into the dst register. 
Otherwise, the dst register is unchanged. The dst and src operands are as¬ 
sumed to be signed integers. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). Note that an LDIU (load integer unconditionally) Instruction is 
useful for loading R0-R7 without affecting the condition flags. 

1 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

LDIZ R4,R6 

Before tnstruction: 

R4 = 027Ch = 636 
R6 = 0FE2h = 4,066 

LUF LVUFNZVC = 0000000 

After instruction: 

R4 = 027Ch = 636 
R6 = 0FE2h = 4,066 

LUFLVUFNZVC = 0000000 



11-85 









LDil 


Load Integer, Interlocked 


Syntax LDII <src>,<dst> 

Operation Signal interlocked operation. 

src -► dst 


Operands src general addressing modes (G); 
0 1 direct 
1 0 indirect 

dst register (Rn, 0 ^ n ^ 27) 


Encoding 


31 

24 23 


1615 

87 

0 

iggg 


I 

G 

! I 1 1 

dst 

— I I 1 — 1 — I — 1 I — 1 1 I I I 1 1 I 

src 


Description The src operand is loaded into the dst register. An interlocked operation is 
signaled over XFO and XF1. The src and dst operands are assumed to be 
signed integers. Note that only the direct and indirect modes are allowed. 
Refer to Section 7.3 for detailed description. 


Cycles 

1 if XF = 0 (see Section 7.3) 

Status Bits 

N 

1 if a negative result Is generated,0 otherwise. 


Z 

1 if a zero result Is generated, 0 otherwise. 


V 

0 


c 

Unaffected. 


UF 

0 


LV 

Unaffected 


LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


LDII @985Fh,R3 

Before instruction: 

DP = 80 
R3 = Oh 

Data at 80985Fh = ODCh 
LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80 
R3 = ODCh 

Data at 80985Fh = ODCh 
LUFLVUFNZVC = 0000000 
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Parallel LDI and LDI 


LDIIILDI 


Syntax LDI <src2>,<dst2> 

II LDI <src1> <dst1> 

Operation src2 dst2 

\\src1 -► dsti 


Operands srcl indirect (disp = 0, 1, IRO, IR1) 
dst7 register (Rn1,0 < n1 ^7) 
src2 indirect (disp = 0, 1, IRO, IR1) 
dst2 register (Rn2, 0 < n2 ^ 7) 

Encoding 


31 24 23 1615 8 7 0 


11 1 

— 1 — 1 —r- 
0 0 0 1 


— \ —1— 
dst2 

—1—1— 

dsti 


-1-1-1-1-1-1-1— 

srcl 

—1— \ —1—1—1—1—1— 
src2 


Description Two integer loads are performed in parallel. A warning is issued by the 
assembler if the LDIs load the same register. The result Is that of LDI 
<src2><dst2>. 


Cycles 
Status Bits 


1 

N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example LDI *-ARl(l),R7 

II LDI *AR74-+(IR0) ,R1 

Before instruction: 

AR1 = 809826h 
R7 = Oh 

AR7 = 8098C8h 
IRO =lOh 
R1 = Oh 

Data at 809825h = OFAh = 250 
Data at 8098C8h = 2EEh = 750 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 809826h 

R7 = OFAh = 250 

AR7 = 8098D8h 

IRO =10h 

R1 = 02EEh = 750 

Data at 809825h = OFAh = 250 

Data at 8098C8h = 2EEh = 750 

LUFLVUFNZVC = 0000000 


11 
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LDillSTI 


Parallel LDI and STI 


Syntax LDI <src2><dst1> 

II STI <src3>,<dst2> 

Operation src2 -* dsti 

11 src3 dst2 


Operands src2 indirect (disp = 0, 1, IRO, IR1) 
dst^ register (Rn1,0 < n1 < 7) 
src3 register (Rn2, 0 ^ n2 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 


31 2423 1615 8 7 0 


m 

omiQQ 

— I — I — 

dsti 

lilBlW 

-T-1- 

src3 

— 1 — 1 — 1 — 1 — 1 — 1 — 
dst2 

— 1 — 1 — 1 — 1 — 1 — 1 — 1 — 
src2 


Description An integer load and an integer store are performed in parallel. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 


Cycles 1 


Status Bits 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit 


OVM Operation not affected by OVM. 


11 
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Parallel LDI and STI 


LDIIISTI 


Example 


LDI *-ARl(l),R2 
II STI R7,*AR5++(IR0) 

Before Instruction: 

AR1 = 8098E7h 
R2 = Oh 
R7 = 35h = 53 
AR5 = 80982Ch 
IRO = 8h 

Data at 8098E6h = ODCh = 220 
Data at 80982Ch = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 8098E7h 
R2 = ODCh = 220 
R7 = 35h = 53 
AR5 = 809834h 
IRO = 8h 

Data at 8098E6h = ODCh = 220 
Data at 80982Ch = 35h = 53 
LUFLVUFNZVC = 0000000 




LDM 


Load Floating-Point Mantissa 


Syntax LDM <src>,<cfst> 

Operation src(man) -♦ ^/s^(man) 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 ^ n ^ 7) 

01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 7) 

Encoding 


31 

24 23 


1615 

87 

0 


QQQQQQI 

u 


I I I I I I I I I I I I I I ! 

src 


Description The mantissa field of the src operand is loaded into the mantissa field of the 
dst register. The dst exponent field Is not modified. The src and dst op¬ 
erands are assumed to be floating-point numbers. If immediate addressing 
mode Is used, bits 15 - 12 of the instruction word are forced to 0 by the 
assembler. 


Cycles 1 


Status Bits 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


LDM 156.75 , R2 (156.75 = 071 CCOOOOOh) 

Before instruction: 

R2 = Oh 

LUFLVUFNZVC = 0000000 
After Instruction: 

R2 = 001 CCOOOOOh = 1.22460938e+00 
LUFLVUFNZVC = 0000000 
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Logical Shift 


LSH 


Syntax LSH <count>,<dst> 

Operation If count ^ 0; 

dst < < count -► dst 
Else: 

dst >> \count\ dst 


Operands src general addressing modes (G): 

0 0 register (Rn, 0 n < 27) 
01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 


31 

24 23 


1615 

87 

0 

BIHH 

QQomiQ 

D 

I I I I 

dst 

T-1 I I I-1 I I I I I I-1 I I 

count 


Description The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count. If the count operand is greater than zero, 
the dst operand is left- shifted by the value of the count operand. Low- 
order bits shifted in are zero-filled, and high-order bits are shifted out 
through the C (carry) bit. 

Logical left-shift; 

C dst ^ 0 


If the count operand is less than zero, the dst Is right-shifted by the absolute 
value of the count operand. The high-order bits of the dst operand are 
zero-filled as shifted to the right. Low-order bits are shifted out through the 
C (carry) bit. 

Logical right-shift: 

0 -* dst C 


Cycles 


If the count operand is 0, no shift is performed and the C (carry) bit Is set 
to 0. The count operand Is assumed to be a signed integer and the dst 
operand Is assumed to be an unsigned Integer. 

1 


11 


Status Bits 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 

Unaffected If dst is not R0-R7. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 
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LSH 


Logical Shift 


Example LSH R4, R7 

Before instruction: 

R4 = 018h = 24 
R7 = 02ACh 

LUFLVUFNZVC = 0000000 

After Instruction: 


R4 = 018h = 24 
R7 = OACOOOOOOh 

LUF LV UF N Z V C = 0 0 0 1 0 1 0 
Example LSH *-AR5 (IRl) ,R5 

Before Instruction: 

ARB = 809908h 
IRO = 4h 

R5 = 0012C00000h 

Data at 809904h = 0FFFFFFF4h = -12 

LUF LVUFNZVC = 0000000 

After Instruction: 

ARB = 809908h 
IRO = 4h 

RB = 0000012C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUFLVUFNZVC = 0000000 


11 
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Logical Shift, 3-Operand 


LSH3 


Syntax LSH3 <count>,<src>,<dst> 

Operation If count ^ 0: 

src < < count -* dst 
Else: 

src >> \count\ dst 


Operands src three-operand addressing modes (T); 

0 0 register (Rn1,0 ^ n < 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 

1 0 register (Rn1,0 < n1 < 26) 

1 1 indirect (disp = 0, 1, IRQ, IR1) 

count three-operand addressing modes (T): 
0 0 register (Rn2, 0 < n2 < 27) 

0 1 register (Rn2, 0 < n2 < 27) 

1 0 indirect (disp = 0, 1, IRQ, IR1) 

1 1 indirect (disp = 0, 1, IRQ, IR1) 

dst register (Rn, 0 < n < 27) 

Encoding 


31 24 23 1615 8 7 0 


ggH 

QQQQQQI 

—I— 

T 

—I—I—I—I— 

dst 

— I—I—I—I—I—I—I— 

src 

—^—,—I—,—,—I—,— 

count 


Description The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Logical left-shift: 

C src ^ 0 


If the count operand is less than zero, the src operand Is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are zero-filled as shifted to the right. Low-order bits are shifted out through 
the C (carry) bit. 

Logical right-shift: 

Q ^ src C 


11 


If the count operand is 0, no shift is performed and the C (carry) bit is set 
to 0. The count operand Is assumed to be a signed integer. The src and 
dst operands are assumed to be unsigned integers. 

Cycles 1 
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LSH3 


Logical Shift, 3-Operand 


Status Bits 


Mode Bit 
Example 


Example 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 
Unaffected if dst is not R0-R7. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

LSH3 R4,R7,R2 

Before Instruction: 

R4 = 018h = 24 
R7 = 02ACh 
R2 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 


R4 = 018h = 24 
R7 = 02ACh 
R2 = OACOOOOOOh 

LUF LVUFNZVC = 0001 01 0 
LSH3 *-AR4(IR1)R5,R3 

Before Instruction: 

AR4 = 809908h 
IR1 = 4h 

R5 = 012C00000h 
R3 = Oh 

Data at 809904h = 0FFFFFFF4h = -12 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 809908h 
IR1 = 4h 

R5 = 012C00000h 

R3 = 0000012C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUF LVUFNZVC = 0000000 



Parallel LSH3 and STI 


LSH3IISTI 


Syntax LSH3 <count> <src2> <dst1> 

II STI <src3>,<dst2> 

Operation If count ^ 0: 

src2 << count -► dsti 
Else: 

src2 >> \count\ -♦ dsti 
11 src3 dst2 


Operands count register (Rn1,0 n1 ^7) 
srcl indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn3, 0 < n3 < 7) 
src2 register (Rn4, 0 < n4 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 


31 


24 23 


1615 


87 


0 


11 1 

—1—1—1—1— 
0 1110 

—1—1— 
dsti 

—1—1— 

count 


—1 —\ —1—1—1—1—1— 
dst2 

- \ -1-1-1-1-!-1- 

src2 


Description The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low- order bits shifted in are zero-filled 
and high-order bits are shifted out through the C (carry) bit. 

Logical left-shift: 

C dst2 0 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are zero filled as shifted to the right. Low-order bits are shifted out through 
the C (carry bit). 

Logical right-shift: 

0 dst2 C 


If the count operand is 0, no shift is performed and the carry bit is set to 0. 

The count operand is assumed to be a 7-bit signed integer and the src2 and 
dsti operands are assumed to be unsigned integers. All registers are read 
at the beginning and loaded at the end of the execute cycle. This means 
that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (LSH3) writes to the same register, 
then STI accepts as Input the contents of the register before it is modified 
by the LSH3. 


11 


If src2 and dst2 point to the same location, src2 Is read before the write to 
dst2. 


Cycles 


1 











LSH3IISTI 


Parallel LSH3 and STI 


Status Bits 


Mode Bit 
Example 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Set to the value of the last bit shifted out. 0 for a shift count of 0. 
UF 0 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

LSH3 R2,*++AR3(1),R0 
I I STI R4,*-AR3 

Before Instruction: 


R2 = 18h = 24 
AR3 = 8098C2h 
RO = Oh 

R4 = ODCh = 220 

AR3 = 8098A3h 

Data at 8098C3h = OACh 

Data at 8098A2h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 


R2 = 18h = 24 

AR3 = 8098C3h 

RO = OACOOOOOOh 

R4 = ODCh = 220 

AR3 = 8098A3h 

Data at 8098C3h = OACh 

Data at 8098A2h = ODCh = 220 

LUF LVUFNZVC = 0001 000 




Parallel LSH3 and STI 


LSH3IISTI 


Example 


LSH3 R7,*AR2—(1),R2 
I I STI R0,*-HAR0(1) 

Before Instruction: 

R7 = 0FFFFFFF4h = -12 
AR2 = 809863h 
R2 = Oh 

RO = 12Ch = 300 

ARO = 8098B7h 

Data at 809863h = 2C000000h 

Data at 8098B8h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R7 = 0FFFFFFF4h = -12 

AR2 = 809862h 

R2 = 2C000h 

RO = 12Ch = 300 

ARO = 8098B7h 

Data at 809863h = 2C000000h 

Data at 8098B8h = 12Ch = 300 

LUFLVUFNZVC = 0000000 



MPYF 


Multipy Floating-Point 


Syntax MPYF <src>,<dst> 

Operation dst x src dst 

Operands src general addressing modes (G); 

0 0 register (Rn, 0 < n < 7) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 7) 

Encoding 


31 

24 23 


1615 

87 

0 

gpQ 

QQQQQQ 

D 

1—1 1 1 

dst 

1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 

src 


Description The product of the dst and src operands is loaded into the dst register. The 
src operand is assumed to be a single-precision floating-point number, and 
the dst operand is an extended-precision floating-point number. 


Cycles 1 


Status Bits 


N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 if a floating-point is overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 If a floating-point underflow occurs, 0 otherwise. 

LV 1 If a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


MPYF R0,R2 

Before Instruction: 

RO = 070C800000h = 1.4050e+02 
R2 = 034C200000h = 1.27578125e+01 
LUFLVUFNZVC = 0000000 

After Instruction: 

RO = 070C800000h = 1.4050e+02 
R2 = 0A600F2000h = 1.79247266e+03 
LUFLVUFNZVC = 0000000 


11-98 






Multiply Floating-Point, 3-Operand 


MPYF3 


Syntax MPYF3 <src2>,<src1>,<dst> 

Operation srcl x src2 dst 

Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1, 0 ^ n1 < 7) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rnl, 0 < n1 < 7) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 ^ n2 < 7) 

0 1 register (Rn2, 0 < n2 < 7) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 < n < 7) 

Encoding 



Description The product of the dst1 and src2 operands is loaded Into the dst register. 

The src1 and src2 operands are assumed to be single-precision floating¬ 
point numbers, and the dst operand Is an extended-precision floating-point 
number. 

Cycles 1 

Status Bits N 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 If a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 


11 
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MPYF3 


Multiply Floating-Point, 3-Operand 


Example 


Example 


MPYF3 R0,R7,R1 

Before Instruction: 

RO = 057B400000h = 6.281250e+01 
R7 = 0733C00000h = 1.79750e+02 
R1 = Oh 

LUF LVUFNZVC = 0000000 

After Instruction: 

RO = 057B400000h = 6.281250e+01 
R7 = 0733C00000h = 1.79750e+02 
R1 = 0D306A3000h - 1.12905469e-f04 
LUFLVUFNZVC = 0000000 

MPYF3 *+AR2(IRO),R7,R2 
or 

MPYF3 R7,*+AR2(IRO),R2 

Before Instruction: 

AR2 = 809800h 
IRO =12Ah 

R7 = 057B400000h = 6.281250e+01 
R2 = Oh 

Data at 80992Ah = 70C8000h = 1.4050e+02 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR2 = 809800h 
IRO = 12Ah 

R7 = 057B400000h = 6.281250e+01 
R2 = 0D09E4A000h = 8.82515625e+03 
Data at 80992Ah = 70C8000h = 1.4050e+02 
LUF LVUFNZVC = 0000000 



Parallel MPYF3 and ADDF3 


IV/IPYF3IIADDF3 


Syntax MPYF3 <srcA><srcB>,<dst1> 

II ADDF3 <srcC>,<srcD>,<dst2> 

Operation srcA x srcB -► dsti 

11 srcC + srcD dst2 


Operands 

srcA 

srcB Any two indirect (disp = 0,1 ,IR0,IR1) 
srcC Any two register (0 < ARn < 7) 
srcD 


dsti 

register 
0 = RO 

idl): 



1 = R1 



dst2 

register 
0 = R2 

{d2): 



1 = R3 



srcl 

register 

(Rn, 0 < 

n < 7) 

src2 

register 

(Rn, 0 < 

n < 7) 

src3 

indirect 

(disp = 0, 

, 1, IRO, IR1) 

src4 

Indirect 

(disp = 0, 

, 1, IRO, IR1) 


P parallel addressing modes (0 < P < 3) 

OPERATION 



00 

src3 X src4, srcl + src2 


01 

src3 X srcl, src4 + src2 


10 

srcl X src2, src3 + src4 


11 

src3 X srcl, src2 + src4 

Encoding 

31 

24 23 

1615 


1 ' 0 

mil 

D 


s 



—1— \ —1—1— \ —1—1— 
src3 

—1—1—1—1—1—1—1— 
src4 


Description A floating-point multiplication and a floating-point addition are performed 
in parallel. All registers are read at the beginning and loaded at the end of 
the execute cycle. This means that if one of the parallel operations (MPYF3) 
reads from a register and the operation being performed in parallel 
(ADDF3) writes to the same register, then MPYF3 accepts as input the 
contents of the register before it is modified by the ADDF3. 

Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as indirect and two are reg¬ 
ister. The assignment of the source operands srcA-srcD to the src1-src4 
fields varies depending on the combination of addressing modes used, and 
the P field Is encoded accordingly. The assembler may, when not signif¬ 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 










MPYF3IIADDF3 


Parallel MPYF3 and ADDF3 


Cycles 
Status Bits 


Mode Bit 
Example 


If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 0 
Z 0 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, 0 unchanged otherwise. 

LUF 1 If a floating-point underflow occurs, 0 unchanged otherwise. 

OVM Operation not affected by OVM. 

MPYF3 *AR5++(1),*--ARl(IRO),R0 
II ADDF3 R5,R7,R3 

Before instruction: 

AR5 = 8098C5h 
AR1 = 8098A8h 
IRQ = 4h 
RO = Oh 

R5 = 0733C00000h = 1.79750e+02 
R7 = 070C800000h = 1.4050e+02 
R3 = Oh 

Data at 8098C5h = 34C0000h = 1.2750e+01 
Data at 8098A4h = 1110000h = 2.2500e+00 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR5 = 8098C6h 
AR1 = 8098A4h 
IRO = 4h 

RO = 0467180000h = 2.88867188e+01 
R5 = 0733C00000h = 1.79750e+02 
R7 = 070C800000h = 1.4050e+02 
R3 = 0820200000h = 3.20250e+02 
Data at 8098C5h = 34C0000h = 1.2750e+01 
Data at 8098A4h = 111 OOOOh = 2.2500e+00 
LUFLVUFNZVC = 0000000 



Parallel MPYF3 and STF 


MPYF31ISTF 


Syntax MPYF3 <src2> <src1> <dst1> 

II STF <src3>,<dst2> 

Operation srcl x src2^ dsti 

11 src3 dst2 

Operands srcl register (Rn1,0 < n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn3, 0 < n3 < 7) 
src3 register (Rn4, 0 ^ n4 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 



Description A floating-point multiplication and a floating-point store are performed in 
parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (MPYF3) 
reads from a register and the operation being performed In parallel (STF) 
writes to the same register, then MPYFS accepts as input the contents of 
the register before It Is modified by the STF. 

If src2 and dst2 point to the same location, then src2 is read before the write 
to dst2. 

Cycles 1 

Status Bits N 1 If a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 If a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 If a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, 0 unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, 0 unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 


11 
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MPYF3IISTF 


Parallel MPYF3 and STF 


Example 


MPYF3 *-AR2(l),R7,R0 
II STF R3,*AR0—(IRO) 

Before instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.281250e+01 
RO = Oh 

R3 = 086B280000h = 4.7031250e+02 
ARO = 809860h 
IRO = 8h 

Data at 80982Ah = 70C8000h = 1.4050e+02 
Data at 809860h = Oh 
LUFLVUFNZVC = 0000000 

After instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.281250e+01 
RO = 0D09E4A000h = 8.82515625e+03 
R3 == 086B280000h = 4.7031250e+02 
ARO = 809858h 
IRO = 8h 

Data at 80982Ah = 70C8000h = 1.4050e+02 
Data at 809860h = 86B280000h = 4.7031250e+02 
LUFLVUFNZVC = 0000000 




Parallel MPYF3 and SUBF3 


MPYF3IISUBF3 


Syntax MPYF3 <srcA> <srcB>,<dst1> 

II SUBF3 <srcC>,<srcD>,<dst2> 

Operation srcA x srcB -* dsti 

11 srcD “ srcC dst2 

Operands 

srcA 

srcB Any two indirect (disp = 0,1 ,IR0,IR1) 
srcC Any two register (0 < ARn < 7) 
srcD 


dsti register {d1)\ 

0 = RO 
1 = R1 

dst2 register i^d2)\ 

0 = R2 
1 = R3 

srcl register (Rn, 0 < n < 7) 

src2 register (Rn, 0 < n < 7) 

src3 indirect (disp = 0, 1, !R0, IR1) 

src4 indirect (disp = 0, 1, IRO, IR1) 


P parallel addressing modes (0 < P < 3) 

OPERATION 

00 src3 X src4, srcl - src2 

01 src3 X srcl, src4 - src2 

10 srcl X src2, src3 - src4 

11 src3 X srcl, src2 ~ src4 

Encoding 



11 
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MPYF3IISUBF3 


Parallel MPYF3 and SUBF3 


Description A floating-point multiplication and a floating-point subtraction are per¬ 
formed in parallel. All registers are read at the beginning and loaded at the 
end of the execute cycle. This means that if one of the parallel operations 
(MPYF3) reads from a register, and the operation being performed in par¬ 
allel (SUBF3) writes to the same register, then MPYF3 accepts as input the 
contents of the register before it is modified by the SUBF3. 

Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as Indirect and two are reg¬ 
ister. The assignment of the source operands srcA-srcD to the src1-src4 
fields varies depending on the combination of addressing modes used, and 
the P field is encoded accordingly. The assembler may, when not signif¬ 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 

Cycles 1 

Status Bits N 0 

Z 0 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 

Example MPYF3 R5 , *++AR7 (IRl) ,R0 

I I SUBF3 R7,*AR3“-(1) ,R2 

or 

MPYF3 R5,*++AR7(IRl),R5,RO 
I 1 SUBF3 R7,*AR3—(1) ,R2 

Before Instruction: 

R5 = 034C000000h = 1.2750e+01 
AR7 = 809904h 
IR1 = 8h 
RO = Oh 

R7 = 0733C00000h = 1.79750e+02 
AR3 = 8098B2h 
R2 = Oh 

Data at 80990Ch = 1110000h = 2.250e+00 
Data at 8098B2h = 70C8000h = 1.4050e+02 
LUFLVUFNZVC = 0000000 
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Parallel MPYF3 and SUBF3 


MPYF3I1SUBF3 


After Instruction: 

R5 = 034C000000h = 1.2750e+01 
AR7 = 80990Ch 
IR1 = 8h 

RO = 0467180000h = 2.88867188e+01 
R7 = 0733C00000h = 1.79750e+02 
AR3 = 8098B1h 

R2 = 05E3000000h = -3.9250e+01 
Data at 80990Ch = IllOOOOh = 2.250e+00 
Data at 8098B2h = 70C8000h = 1.4050e+02 
LUFLVUFNZVC = 0000000 




MPYI 


Multiply Integer 


Syntax MPYI <src>,<dst> 

Operation dst x src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 

0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 n < 27) 

Encoding 

31 24 23 1615 8 7 0 


0 10 10 1 



src 


Description The product of the dst and src operands is loaded into the dst register. The 
src and dst operands when read are assumed to be 24-bit signed integers. 
The result Is assumed to be a 48-bit signed Integer. The output to the dst 
register is the 32 least-significant bits of the result. 

Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-bit output value. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an Integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unchanged. 

Mode Bit OVM Operation affected by OVM. 


Example 



MPYI R1,R5 

Before Instruction: 

R1 = 000033C251 h = 3,392,081 
R5 = 000078B600h = 7,910,912 
LUFLVUFNZVC = 0000000 


After Instruction: 

R1 = 000033C251 h = 3,392,081 

R5 = 00E21 D9600h = -501,377,536 

LUF LV UF N Z V C = 0 1 0 1 0 1 0 
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Multiply Integer, 3-Operand 


MPYI3 


Syntax MPYI3 <src2>,<src1> <dst> 

Operation srcl x src2 -► dst 

Operands srcl three-operand addressing modes (T): 

0 0 register {Rn1, < n1 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rnl, ^ n1 < 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, < n2 < 27) 

0 1 register (Rn2, < n2 < 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 < n < 27) 


Encoding 


31 24 23 1615 8 7 0 


illMH 

o 

o 

“T-1-1- 

0 1 0 

T 

—1—1—1—1— 
dst 

—1—1—1—1—1—1—1— 
srcl 

—I—1—1—1—1—1—1— 

src2 


Description The product of the srcl and src2 operands is loaded into the dst register. 

The srcl and src2 operands are assumed to be 24-blt signed integers. The 
result is assumed to be a signed 48-bit integer. The output to the dst reg¬ 
ister is the 32 least-significant bits of the result. 


Cycles 
Status Bits 


Mode Bit 


Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-bit output value. 

1 


N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 
LUF Unchanged. 

OVM Operation affected by OVM. 


11 
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MPYta 


Multiply Integer, 3-Operand 


Example MPYI3 *AR4 , *-ARl (1) ,R2 

Before instruction: 

AR4 = 809850h 
AR1 = 8098F3h 
R2 = Oh 

Data at 809850h = OADh = 173 
Data at 8098F2h = ODCh = 220 
LUFLVUFNZVC = 0000000 

After instruction: 

AR4 = 809850h 

AR1 = 8098F3h 

R2 = 094ACh = 38,060 

Data at 809850h = OADh = 173 

Data at 8098F2h = ODCh = 220 

LUF LVUFNZVC = 0000000 

Example MPYI3 *—AR4 (IRO) ,R2 ,R7 

Before instruction: 

AR4 = 8099F8h 
IRO = 8h 

R2 = 0C8h = 200 
R7 = Oh 

Data at 8099F0h = 32h = 50 
LUFLVUFNZVC = 0000000 

After instruction: 

AR4 = 8099F0h 
IRO = 8h 

R2 = 0C8h = 200 
R7 = 0271 Oh = 10,000 
Data at 8099F0h = 32h = 50 
LUFLVUFNZVC = 0000000 


11 


11-110 



Parallel MPYI3 and ADDIS 


MPYI3IIADDI3 


Syntax MPYI3 <srcA> <srcB>,<dst1> 

II ADD13 <srcC>,<srcD>,<dst2> 

Operation srcA x srcB dsti 

11 srcD + srcC dst2 

Operands 

srcA 

srcB Any two Indirect (disp = 0,1,IR0,IR1) 
srcC Any two register (0 ^ ARn ^ 7) 
srcD 


dsti register (d1): 

0 = RO 
1 = R1 

dst2 register (d2)\ 

0 = R2 
1 = R3 

srcl register (Rn, 0 < n < 7) 

src2 register (Rn, 0 < n < 7) 

src3 Indirect (disp = 0, 1, IRO, IR1) 

src4 indirect (disp = 0, 1,IR0, 1R1) 


P 


Encoding 

31 


0 0 10 


parallel addressing modes (0 < P < 3) 
OPERATION 

00 src3 X src4, srcl + src2 

01 src3 X srcl, src4 + src2 

10 srcl X src2, src3 + src4 

11 src3 X srcl, src2 + src4 


2423 1615 87 



src4 


0 







MPYI3IIADDI3 


Parallel MPYI3 and ADDI3 


Description 


Cycles 
Status Bits 


Mode Bit 
Example 


An integer multiplication and an integer addition are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that If one of the parallel operations (MPYI3) reads from 
a register and the operation being performed in parallel (ADDIS) writes to 
the same register, then MPYI3 accepts as input the contents of the register 
before it Is modified by the ADDIS. 

Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as indirect and two are reg¬ 
ister. The assignment of the source operands srcA-srcD to the src1-src4 
fields varies depending on the combination of addressing modes used, and 
the P field Is encoded accordingly. The assembler may, when not signif¬ 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 

1 

N 0 

Z 0 

V 1 if an integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unchanged. 

OVM Operation affected by OVM. 

MPYI3 R7,R4,R0 
II ADDI3 *“AR3,*AR5—(1),R3 

Before instruction: 

R7 = 14h = 20 
R4 = 64h = 100 
RO = Oh 

AR3 = 80981 Fh 
ARB = 80996Eh 
R3 = Oh 

Data at 80981 Eh = OFFFFFFCBh = -53 
Data at 80996Eh = 35h = 53 
LUFLVUFNZVC = 0000000 

After Instruction: 

R7 = 14h = 20 
R4 = 64h = 100 
RO = 07D0h = 2000 
AR3 = 80981 Fh 
ARB = 80996Dh 
R3 = Oh 

Data at 80981 Eh = OFFFFFFCBh = -53 

Data at 80996Eh = 35h = 53 

LUF LVUFNZVC = 0000000 




Parallel MPYI3 and STI3 


MPYI3IISTI 


Syntax MPYI3 <src2>,<src1>,<dst1> 

II STI <src3>,<dst2> 

Operation srcl x src2 -*■ dsti 

11 src3 -► dst2 

Operands srcl register (Rnl, 0 ^ n1 ^1) 

src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn3, 0 ^ n3 ^ 7) 
src3 register (Rn4, 0 ^ n4 ^ 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 



Description An integer multiplication and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (MPYI3) writes to the 
same register, then STI accepts as input the contents of the register before 
it Is modified by the MPYI3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-blt output value. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 0 

LV 1 If an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 
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MPYI3IISTI 


Parallel MPYI3 and STI3 


Example 


MPYI3 *++ARO(l),R5,R7 
II STI R2,*"AR3(1) 

Before instruction: 

ARO = 80995Ah 
R5 = 32h = 50 
R7 = Oh 

R2 = ODCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

ARO = 80995Bh 

R5 = 32h = 50 

R7 = 271 Oh = 10000 

R2 == ODCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = ODCh = 220 

LUFLVUFNZVC = 0000000 



Parallel MPYI3 and SUBI3 


MPYI3IISUBI3 


Syntax MPYI3 <srcA>,<srcB>,<dst1> 

II SUBI3 <srcC>,<srcD>,<dst2> 

Operation srcA x srcB -► dst1 

11 srcD - srcC dst2 


Operands 

srcA 

srcB Any two indirect (disp = 0,1,IR0JR1) 
srcC Any two register (0 ^ ARn ^ 7) 
srcD 


dsti register {d1)\ 

0 = RO 
1 = R1 

dst2 register (c/2): 

0 = R2 
1 = R3 


src1 

src2 

src3 

src4 


register (Rn, 0 ^ n < 7) 
register (Rn, 0 < n < 7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 


P parallel addressing modes (0 < P < 3) 

OPERATION 

00 src3 X src4, srcl - src2 

01 src3 X srcl, src4 - src2 

10 srcl X src2, src3 - src4 

11 src3 X srcl, src2 - src4 


Encoding 


31 


24 23 


1615 


87 


0 


1 'o 

□gm 

H 

SI 

i 


—I—1— 

src2 

—1—1—1—1—1—1—1— 
src3 

—1—1—1—1—1— \ —1— 
src4 


Description An Integer multiplication and an Integer subtraction are performed in par¬ 
allel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (MPYI3) 
reads from a register and the operation being performed in parallel (SUBI3) 
writes to the same register, then MPYI3 accepts as input the contents of the 
register before it is modified by the SUBI3. 


11 


Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as indirect and two are reg¬ 
ister. The assignment of the source operands srcA-srcD to the srcl-src4 
fields varies depending on the combination of addressing modes used, and 
the P field Is encoded accordingly. The assembler may, when not signif¬ 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 
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MPYI3IISUBI3 


Parallel MPYI3 and SUBI3 


Cycles 
Status Bits 


Mode Bit 


Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-bit output value. 

1 

N 0 

Z 0 

V 1 if an integer overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if an integer underflow occurs, 0 otherwise. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unchanged. 

OVM Operation affected by OVM. 


Example 


MPYI3 R2,*4-hAR0(l) ,R0 
II SUBI3 *AR5—(IRl) ,R4,R2 
or 


MPYI3 *+-hARO(l) ,R2,R0 
II SUBI3 *AR5—(IR1),R4,R2 

Before Instruction: 


R2 = 32h = 50 
ARO = 8098E3h 
RO = Oh 

ARB = 8099FCh 

IR1 = OCh 

R4 = 07D0h = 2000 

Data at 8098E4h = 62h = 98 

Data at 8099FCh = 4B0h = 1200 

LUFLVUFNZVC^OOOOOOO 
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After Instruction: 

R2 = 320h == 800 

ARO = 8098E4h 

RO = 01324h = 4900 

ARB = 8099F0h 

IR1 = OCh 

R4 = 07D0h = 2000 

Data at 8098E4h = 62h = 98 

Data at 8099FCh = 4B0h = 1200 

LUFLVUFNZVC = 0000000 
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Negative Integer with Borrow 


NEGB 

Syntax 

NEGB <src>,<dst> 



Operation 

0 - src - C ^ dst 



Operands 

src general addressing modes (G); 

0 0 register (Rn, 0 < n 27) 

0 1 direct 

1 0 indirect 

1 1 immediate 




dst register (Rn, 0 ^ n < 27) 



Encoding 




31 

2423 1615 

87 

0 


I I I I I I I I I I I I I I I I I I I I 

0 0 0 0 1 0 1 1 0 G dst src 


Description The difference of the 0, src, and C operands is loaded into the dst register. 
The dst and src are assumed to be signed integers. 

Cycles 1 


Status Bits 

N 

1 if a negative result is generated, 0 otherwise. 


Z 

1 if a zero result is generated, 0 otherwise. 


V 

1 if an integer overflow occurs, 0 otherwise. 


c 

1 if a borrow occurs, 0 otherwise. 


UF 

0 


LV 

1 If an Integer overflow occurs, unchanged otherwise. 


LUF 

Unaffected. 

Mode Bit 

OVM Operation affected by OVM. 

Example 

NEGB 

R5,R7 


Before Instruction: 

R5 = OFFFFFFCBh = -53 
R7 = Oh 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After Instruction: 

R5 = OFFFFFFCBh = -53 
R7 = 34h = 52 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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NEGF 


Negate Floating-Point 


Syntax NEGF <src><dst> 

Operation 0 - src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 7) 
01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 < n < 7) 

Encoding 


31 

24 23 


1615 

87 

0 

fllitll 

QumnoQ 

G 

1 1 1 1 

dst 

1 1 I 1 I 1 I 1 1 1 1 I 1 1 I 

src 


Description The difference of the 0 and src operands is loaded Into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 


Cycles 
Status Bits 


Mode Bit 


1 

N 1 If a negative result Is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 If a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 
LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 


Example 


11 


NEGF *++AR3(2),Rl 

Before Instruction: 

AR3 = 809800h 

R1 = 057B400025h = 6.28125006e+01 
Data at 809802h = 70C8000h = 1.4050e+02 
LUFLVUFNZVC = 0000000 

After instruction: 

AR3 = 809802h 

R1 = 07F3800000h = -1.4050e+02 

Data at 809802h = 70C8000h = 1.4050e+02 

LUF LVUFNZVC = 0001 000 
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Parallel NEGF and STF 


NEGFIISTF 


Syntax NEGF <src2> <clst1> 

II STF <src3><dst2> 

Operation 0 - src2 -► dst1 

11 src3 -► dst2 


Operands 


Encoding 


src2 indirect (disp = 0, 1, IRO, IR1) 
dst7 register (Rn1,0 < n1 ^7) 
src3 register (Rn2, 0 ^ n2 ^ 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


31 24 23 1615 8 7 0 


no 

— \ —I—I—I— 
1 0 0 0 1 

—1—1— 
dsti 

IMiH 

— 1 —r— 
src3 

—1—1—1— \ —1—1—1— 
dst2 

—1—1—1—1—1—1—1— 
src2 


Description A floating-point negation and a floating-point store are performed in par¬ 
allel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (NEGF) writes 
to the same register, then STF accepts as Input the contents of the register 
before It is modified by the NEGF. 


If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 


Cycles 1 


Status Bits 


Mode Bit 


N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 
LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 


11 
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NEGFIISTF 


Parallel NEGF and STF 


Example 


NEGF *AR4—(1),R7 
II STF R2,*++AR5(1) 

Before Instruction: 

AR4 = 8098E1h 
R7 = Oh 

R2 = 0733C00000h = 1.79750e+02 
AR5 = 809803h 

Data at 8098E1h = 57B400000h = 6.281250e+01 
Data at 809804h = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 8098E0h 

R7 = 0584C00000h = -6.281250e+01 
R2 = 0733C00000h = 1.79750e+02 
ARB = 809804h 

Data at 8098E1h = 57B4000h = 6.281250e+01 
Data at 809804h = 733C000h = 1.79750e+02 
LUF LV UF N Z V C = 0 0 0 1 0 0 0 



Negate Integer 


NEGI 


Syntax NEGI <src>,<dst> 

Operation 0 - src ^ dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 

0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 < n < 27) 

Encoding 

31 24 23 1615 8 7 0 


src 


Description The difference of the 0 and src operands is loaded Into the dst register. The 
dst and src operands are assumed to be signed Integers. 

Cycles 1 

Status Bits N 1 If a negative result is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 If an integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 If an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 

Example NEGI 174, R5 (174 = OAEh) 

Before Instruction: 

R5 = ODCh = 220 

LUFLVUFNZVC = 0000000 

After Instruction: 

R5 = 0FFFFFF52 = -174 

LUF LV UF N Z V C = 0 0 0 1 0 0 1 









NEGIIISTI 


Parallel NEGI and STI 


Syntax NEGI <src2>,<dst1> 

II STI <src3> ,<dst2> 

Operation 0 - src2 dst7 
11 src3 dst2 


Operands 


src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn1,0 n1 £ 7) 
src3 register (Rn2, 0 ^ n2 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


Encoding 

31 2423 1615 8 7 0 

I—I—I—r—I—I—I—I—I—r—I—I—I—I—I—j—I—I—I—I—I—I—I—I—I—I—I—I—I—!—I—I—I 


na 

QQQQQ 



gg 




Description An integer negation and an integer store are performed in parallel. All reg¬ 
isters are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STI) reads from a register 
and the operation being performed in parallel (NEGI) writes to the same 
register, then STI accepts as Input the contents of the register before It is 
modified by the NEGI. 


If src2 and dst2 point to the same location, src2 Is read before the write to 
dst2. 


Cycles 1 


Status Bits 


N 1 If a negative result is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 If an Integer overflow occurs, unchanged otherwise. 
LUF Unaffected. 


Mode Bit 


OVM Operation affected by OVM. 


11 
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Parallel NEGI and STI 


NEGIilSTI 


Example 


NEGI *“AR3,R2 
II STI R2,*ARH-+ 

Before Instruction: 

AR3 = 80982Fh 

R2 = 19h = 25 

AR1 = 8098A5h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 80982Fh 

R2 = 0FFFFFF24h = -220 

AR1 = 8098A6h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h = 19h - 25 

LUF LV UF N Z V C = 0 0 0 1 0 0 1 


11 
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NOP 


No Operation 


Syntax NOP <src> 

Operation No ALU or multiplier operations. 

ARn is modified if src is specified in indirect mode. 

Operands src general addressing modes (G): 

0 0 register (no operation) 

1 0 indirect (modify ARn, 0 ^ n <7) 

Encoding 


31 24 23 1615 8 7 0 



Description If the src operand is specified in the indirect mode, the specified addressing 
operation is performed and a dummy memory read occurs. If the src oper¬ 
and is omitted, no operation is performed. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example NOP 

Before Instruction: 

PC = 3Ah 

After Instruction: 

PC = 3Bh 

Example NOP *AR3—(1) 

Before Instruction: 

PC = 5h 
AR3 = 809900h 

After Instruction: 

PC = 6h 
AR3 = 8098FFh 
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Normalize 


NORM 


Syntax NORM <src>,<dst> 

Operation norm {src) dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 ^ n ^ 7) 
0 1 direct 
1 0 indirect 
1 1 immediate 


Encoding 


31 


24 23 


1615 

87 

0 

IMMlii 

gn 


1 

G 

1 1 1 — 1 — 

dst 

— 1 1 1 1 1 1 1 — 1 1 — 1 1 1 — 1 — 1 1 

src 


Description The src operand is assumed to be an unnormalized floating-point number, 
i.e., the implied bit is set equal to the sign bit. The dst Is set equal to the 
normalized src operand with the implied bit removed. The dst operand ex¬ 
ponent is set to the src operand exponent minus the size of the left-shift 
necessary to normalize the src. The dst operand is assumed to be a nor¬ 
malized floating-point number. 


Cycles 
Status Bits 


Mode Bit 


If src(exp) = -128 and 5rc(man) = 0, then dst = 0, Z = 1, and UF = 0. If 
srcCexp) = -128 and 5r(7(man) ^ 0, then dst = 0,Z = 0, and UF = 1. For 
all other cases of the src, if a floating-point underflow occurs, then 
dst(vf\an) Is forced to 0 and cys?(exp) = -128. If 5A'c(man) = 0, then 
r/5f(man) = 0 and dst{exp) = -128. Refer to Section 5.6. 

1 

N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 1 If a floating-point underflow occurs, 0 otherwise. 

LV Unaffected. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 


Example NORM R1, R2 

Before Instruction: 

R1 = 0400003AF5h 
R2 = 070C800000h 

LUFLVUFNZVC = 0000000 
After Instruction: 

R1 = 0400003AF5h 

R2 = F26BD40000h = 1.12451613e~04 
LUFLVUFNZVC = 0000000 


11 
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NOT 


Bitwise Logical-Complement 


Syntax NOT <src>,<dst> 

Operation ^^src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 < n ^ 27) 


Encoding 


31 


24 23 


1615 

87 

0 

laiMH 

OB 

iggg 

G 

1 1 — 1 — 1 — 

dst 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 


Description The bitwise logical-complement of the src operand Is loaded into the dst 
register. The complement is formed by a logical-NOT of each bit of the src 
operand. The dst and src operands are assumed to be unsigned integers. 


Cycles 1 


Status Bits 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


NOT @982Ch,R4 

Before Instruction: 

DP = 80h 
R4 = Oh 

Data at 80982Ch = 5E2Fh 
LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R4 = 0FFFFA1 DOh 

Data at 80982Ch = 5E2Fh 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 
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Parallel NOT and STI 


NOTIISTI 


Syntax NOT <src2>,<dst1> 

II STI <src3>,<dst2> 

Operation '^src2 dsti 

11 src3 dst2 


Operands src2 indirect (disp = 0, 1, IRO, IR1) 
dst1 register (Rn1,0 < n1 < 7) 
src3 register (Rn2, 0 < n2 < 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 


31 24 23 1615 8 7 0 


1 1 ' 1 1 

-1-1-T" 

10 0 1 

3 

—1—1— 
dsti 


-1-1— 

src3 

—1— \ —1—1—1—1—1— 
dst2 

—1—1—1—1—1—1—1— 
src2 


Description A bitwise logical-NOT and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (NOT) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the NOT. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 


Cycles 1 


Status Bits 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit 


OVM Operation not affected by OVM. 


11 
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NOTIISTI 


Parallel NOT and STI 


Example 


NOT *+AR2,R3 
II STI R7,*—AR4(IR1) 

Before Instruction: 

AR2 = 8099CBh 
R3 = Oh 

R7 = ODCh = 220 
AR4 = 809850h 
IR1 =10h 

Data at 8099CCh = 0C2Fh 
Data at 809840h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR2 = 8099CBh 
R3 = 0FFFFF3D0h 
R7 = ODCh = 220 
AR4 = 809840h 
IR1 = 10h 

Data at 8099CCh = 0C2Fh 

Data at 809840h = ODCh = 220 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 



Bitwise Logical-OR 


OR 


Syntax 

OR <src><dst> 

Operation 

dst OR 

src -► dst 

Operands 

src general addressing modes (G): 


00 

register (Rn, 0 ^ n ^ 27) 


0 1 

direct 


1 0 

indirect 


1 1 

immediate (not sign-extended) 


dst register (Rn, 0 n ^ 27) 


Encoding 


31 


24 23 


1615 

87 

0 

HUIW 

D 

IQQQQQ 

G 

1 1 1 1 

dst 

1 1 1 — 1 1 1 1 1 1 1 1 1 1 1 1 

src 


Description The bitwise logical-OR between the src and dst operands is loaded into the 
dst register. The dst and src operands are assumed to be unsigned integers. 


Cycles 1 


Status Bits 


N MSB of the output 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


11 
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OR 


Bitwise Logical-OR 


Example OR *++ARl (IRl) , R2 

Before instruction: 

AR1 = 809800h 
IR1 = 4h 

R2 = 012560000h 

Data at 809804h = 2BCDh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 809804h 
IR1 = 4h 

R2 = 012562BCDh 

Data at 809804h = 2BCDh 

LUFLVUFNZVC = 0000000 


11 
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Bitwise Logical-OR, 3-Operand 


0R3 


Syntax 0R3 <src2><src1><dst> 

Operation srcl OR src2 dst 

Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1,0 ^ n1 ^ 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rn1,0 ^ n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 ^ n2 ^ 27) 

0 1 register (Rn2, 0 ^ n2 ^ 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 < n ^ 27) 

Encoding 


31 2423 1615 87 0 



Description The bitwise logical-OR between the srcl and src2 operands is loaded into 
the dst register. The srcl, src2, and dst operands are assumed to be un¬ 
signed integers. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


11 
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OR3 


Bitwise Logical-OR, 3-Operand 


Example 


0R3 *++ARl(IRl),R2,R7 

Before Instruction: 

AR1 = 809800h 
IR1 = 4h 

R2 = 012560000h 
R7 = Oh 

Data at 809804h = 2BCDh 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 809804h 
IR1 = 4h 

R2 = 012560000h 
R7 = 012562BCDh 
Data at 809804h = 2BCDh 
LUFLVUFNZVC = 0000000 



Parallel 0R3 and STI 


0R3IISTI 


Syntax 0R3 <src2>,<src1>,<dst1> 

II STI <src3> <dst2> 

Operation srcl OR src2 dsti 

11 src3 dst2 

Operands srcl register (Rn1,0 ^ n1 ^7) 

src2 indirect (disp = 0, 1, IRO, IRl) 
dst1 register (Rn2, 0 ^ n2 ^ 7) 
src3 register (Rn3, 0 n3 ^ 7) 
dst2 indirect (disp = 0, 1, IRO, IRl) 

Encoding 



Description A bitwise logical-OR and an integer store are performed in parallel. All re¬ 
gisters are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STI) reads from a register 
and the operation being performed in parallel (OR3) writes to the same re¬ 
gister, then STI accepts as input the contents of the register before It is 
modified by the OR3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


11 
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0R3IISTI 


Parallel 0R3 and STI 


Example 


0R3 *+4-AR2,R5,R2 

I I STI R6,*AR1— 

Before Instruction: 

AR2 = 809830h 
R5 = 800000h 
R2 = Oh 

R6 = ODCh = 220 

AR1 = 809883h 

Data at 809831 h = 9800h 

Data at 809883h = Oh 

LUFLVUFNZVC = 0000000 

After instruction: 

AR2 = 809831 h 

R5 = 800000h 

R2 = 809800h 

R6 = ODCh = 220 

AR1 = 809882h 

Data at 809831 h = 9800h 

Data at 809883h = ODCh = 220 

LUFLVUFNZVC = 0000000 




POP Integer 


POP 


Syntax POP <dst> 

Operation *SP-- -► dst 

Operands dst register (Rn, 0 < n < 27) 

Encoding 


31 24 23 1615 8 7 0 








Description The top of the current system stack is popped and loaded into the dst reg¬ 
ister. The top of the stack is assumed to be a signed integer. The POP is 
performed with a post decrement of the stack pointer. 


Cycles 1 


Status Bits 


N 1 if a negative result is generated, 0 otherwise. 
Z 1 if a zero result is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example POP R3 

Before Instruction: 

SP = 809856h 

R3 = 012DAh = 4,826 

Data at 809856h = 0FFFF0DA4h = -62,044 

LUFLVUFNZVC = 0000000 


After Instruction: 

SP = 809855h 

R3 = 0FFFF0DA4h = -62,044 

Data at 809856h = 0FFFF0DA4h = -62,044 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 


11 
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POPF 


POP Floating-Point 


Syntax POPF <dst> 

Operation *SP-- dsti 

Operands dst register (Rn,0<n<7) 

Encoding 



Description The top of the current system stack is popped and loaded into the dst reg¬ 
ister. The top of the stack is assumed to be a floating-point number. The 
POP is performed with a post decrement of the stack pointer. 


Cycles 1 


Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example POPF R4 

Before Instruction: 

SP = 80984Ah 

R4 = 025D2E0123h = 6.91186578e+00 

Data at 80984Ah = 5F2C1302h = 5.32544007e+28 

LUFLVUFNZVC = 0000000 

After Instruction: 

SP = 809849h 

R4 = 5F2C130200h = 5.32544007e+28 

Data at 80984Ah = 5F2C1302h = 5.32544007e+28 

LUFLVUFNZVC = 0000000 
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PUSH Integer 


PUSH 


Syntax PUSH <src> 

Operation src -♦ * + + S P 

Operands src register (Rn,0 < n < 27) 

Encoding 


31 24 23 1615 8 7 0 


—1—1— 
0 0 0 

— \ —1—1—1—1— 
0 11110 

—1— 
0 1 

—1—1—1—1— 

src 

QQQQQQQQQQQQQQQQ 


Description The contents of the src register are pushed on the current system stack . The 
src is assumed to be a signed integer. The PUSH is performed with a pre¬ 
increment of the stack pointer. 


Cycles 1 


Status Bits 


N 

Unaffected. 

Z 

Unaffected. 

V 

Unaffected. 

c 

Unaffected. 

UF 

Unaffected. 

LV 

Unaffected. 

LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example PUSH R6 

Before Instruction: 

SP = 8098AEh 

R6 = 815Bh = 33,115 

Data at 8098AFh = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

SP = 8098AFh 

R6 = 815Bh = 33,115 

Data at 8098AFh = 815Bh = 33,11 5 

LUFLVUFNZVC = 0000000 


11 
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PUSHF 


PUSH Floating-Point 


11 


Syntax PUSHF <src> 

Operation src -♦ * ++SP 

Operands src register (Rn, 0 ^ n ^ 7) 

Encoding 


31 24 23 1615 8 7 0 



QHnHnn 

m 




Description The contents of the src register are pushed on the current system stack . The 
src is assumed to be a floating-point number. The PUSH is performed with 
a preincrement of the stack pointer. 


Cycles 
Status Bits 


1 

N 

Unaffected. 

Z 

Unaffected. 

V 

Unaffected. 

c 

Unaffected. 

UF 

Unaffected. 

LV 

Unaffected. 

LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example PUSHF R2 

Before Instruction: 

SP = 809801 h 

R2 = 025C128081 h = 6.87725854e+00 
Data at 809802h = Oh 

LUFLVUFNZVC = 0000000 
After Instruction: 

SP = 809802h 

R2 = 025C128081 h = 6.87725854e+00 

Data at 809802h = 025C1280h = 6.87725830e+00 

LUFLVUFNZVC = 0000000 
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Return From Interrupt Conditionally RETIcond 


Syntax 

REWcond 



Operation 

If cond is true: 

*SP- PC 

1 ST(GIE). 

Else, continue. 


Operands 

None 




Encoding 





31 

24 23 


1615 87 0 

QQHQQ 

Wiiliiliil 

go 


111111111111111 

0000000000000000 


Description A conditional return is performed. If the condition is true, the top of the 
stack is popped to the PC, and 1 is written to the global interrupt enable 
(GIE) bit of the status register. This has the effect of enabling ail interrupts 
for which the corresponding interrupt enable bit is a 1. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 9.1 for a list of condition mnemonics, encoding, 
and flags). 


Cycles 

4 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example RETINZ 

Before Instruction: 

PC = 456h 
SP = 809830h 
ST = Oh 

Data at 809830h = 123h 
LUFLVUFNZVC = 0000000 

After Instruction: 

PC =123h 
SP = 80982Fh 
ST = 2000h 

Data at 809830h = 123h 
LUFLVUFNZVC = 0000000 
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RETScond 


Return From Subroutine Conditionally 


Syntax 

RETSconry 

Operation 

If cond is true: 


*SP-- - PC. 


Else, continue. 

Operands 

None 

Encoding 


31 

24 23 



iilMlIlH 

QQ 

— 1 — I — 1 — 1 — 

cond 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

0000000000000000 


Description A conditional return is performed. If the condition is true, the top of the 
stack is popped to the PC, and 1 is written to the global interrupt enable 
(GIE) bit of the status register. This has the effect of enabling all interrupts 
for which the corresponding interrupt enable bit is a 1. 

The TI\/IS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 9.1 for a list of condition mnemonics, encoding, 
and flags). 


Cycles 

4 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


RETSGE 

Before Instruction: 

PC =123h 

SP = 80983Ch 

Data at 80983Ch = 456h 

LUFLVUFNZVC = 0000000 

After Instruction: 

PC = 456h 

SP = 80983Bh 

Data at 80983Ch = 456h 

LUFLVUFNZVC = 0000000 
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Round Floating-Point 


RND 


Syntax RND <src><dst> 


Operation rr)di(src) -*■ dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n < 7) 
0 1 direct 
1 0 indirect 
1 1 Immediate 

dst register (Rn, 0 ^ n ^ 7) 


Encoding 


31 

24 23 


1615 

87 

0 

gOg 

— 1 — 1 — 1 — 1 — 1 — 
1 0 0 0 1 0 


1 — 1 1 1 

dst 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 


Description The result of rounding the src operand is loaded Into the dst register. The 
src operand is rounded to the nearest single-precision floating-point value. 
If the src operand Is exactly half-way between two single-precision values, 
it is rounded to the most positive of those values. 


Cycles 
Status Bits 


Mode Bit 


1 

N 1 if a negative result is generated, 0 otherwise, 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 If a floating-point overflow occurs, unchanged otherwise. 
LUF 1 If a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 


Example RND R5, R2 

Before Instruction: 

R5 = 0733C16EEFh = 1.79755599e+02 
R2 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R5 = 0733C16EEFh = 1.79755599e+02 
R2 = 0733C16F00h = 1.79755600e+02 
LUFLVUFNZVC = 0000000 


11 


11-141 








ROL 


Rotate Left 


Syntax ROL <dst> 

Operation dst I eft-rotated 1 bit dst 

Operands dst register (Rn, 0 < n ^ 27) 
Encoding 


31 


24 23 


1615 87 0 


D 

IQQQQQ 

m 

—1—1—1—1— 

dst 

—1—1—I—1— \ —1—1—1—1—1—1—1—1—1—1— 

0000000000000001 


Description The contents of the dst operand are left-rotated one bit and loaded into the 
dst register. This rotate is a circular rotate with the MSB transferred into the 
LSB. 

Rotate left: 



Cycles 1 


Status Bits 


N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Set to the value of the bit rotated out of the high-order bit. Unaf¬ 
fected if dst is not R0-R7. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


ROL R3 

Before Instruction: 

R3 = 80025CD4h 

LUFLVUFNZVC = 0000000 
After Instruction: 

R3 = 0004B9A9h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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Rotate Left Through Carry 


ROLC 


Syntax 

Operation 

Operands 

Encoding 

31 


ROLC <dst> 

dst left-rotated 1 bit through carry bit dst 
dst register (Rn, 0 < n ^ 27) 

2423 1615 


87 


0 



QomiiQQ 

DO 

llllllllllll^^ 



Description The contents of the dst operand are left-rotated one bit through the carry 
bit and loaded into the dst register. The MSB is rotated to the carry bit at 
the same time the carry bit is transferred to the LSB. 

Rotate left through carry bit; 

C ^ dst ^ 


Cycles 
Status Bits 


Mode Bit 
Example 


Example 


1 

N MSB of the output 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Set to the value of the bit rotated out of the high-order bit. If dst is 
not R0-R7, then C is shifted into the dst but not changed. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

ROLC R3 

Before Instruction: 

R3 = 00000420h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 


After Instruction: 

R3 = 000000841 h 

LUFLVUFNZVC = 0000000 

ROLC R3 


11 


Before Instruction: 

R3 = 80004281 h 

LUFLVUFNZVC = 0000000 


After Instruction: 

R3 = 00008502h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
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ROR 


Rotate Right 


Syntax ROR <dst> 

Operation dst right-rotated 1 bit through carry bit dst 
Operands dst register (Rn, 0 ^ n ^ 27) 

Encoding 


31 


24 23 


1615 87 0 

mm 

D 

IQQQQQ 

■ 

—I—1—1—1— 

dst 

—1—1—1—I—1—I I—1—i—1—1—1—I i—1— 

1111111111111111 


Description The contents of the dst operand are right-rotated one bit and loaded into 
the dst register. The LSB is rotated into the carry bit and also transferred 
into the MSB. 

Rotate right: 



Cycles 
Status Bits 


Mode Bit 
Example 


11 


1 

N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Set to the value of the bit rotated out of the low-order bit. Unaf¬ 
fected If dst is not R0-R7. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

ROR R7 

Before Instruction: 

R7 = 00000421 h 

LUFLVUFNZVC = 0000000 
After Instruction: 

R7 = 8000021 Oh 

LUF LVUFNZVC = 0001 001 
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Rotate Right Through Carry 


RORC 


Syntax RORC <dst> 

Operation dst right-rotated 1 bit through carry bit dst 
Operands dst register (Rn, 0 < n ^ 27) 

Encoding 


31 


24 23 


1615 87 0 


D 

IQQQQQ 

BB 

I 1 1 I 

dst 

I 1 1 1 I 1 I 1 I I I 1 1 I \ 

1111111111111111 


Description The contents of the dst operand are right-rotated one bit through the carry 
bit and loaded into the dst register. The LSB is rotated into the carry bit. 
At the same time, the carry bit is transferred into the MSB. 

Rotate right through carry bit: 


c 


C dst 


3 


Cycies 1 


Status Bits 


N MSB of the output 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Set to the value of the bit rotated out of the low-order bit. If dst is 
not R0-R7, then C is shifted in but not changed. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example RORC R4 

Before Instruction: 

R4 = 00000081 h 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 
After Instruction: 

R4 = 80000040h 

LUF LV UF N Z V C = 0 0 0 1 0 0 1 


11 
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RPTB 


Repeat Block 


Syntax 

RPTB 

<src> 



Operation 

src -► 

RE 




1 ST(RM) 




Next PC RS 



Operands 

src long-immediate addressing mode 



Encoding 





31 

24 23 1615 

87 

0 

QQQQQI 

IQQ 

—1— I — I j 1 —? 1—1—1— I I I 1 1 ? 1 1 I 1 I 1 I I 

src 


Description RPTB allows a block of instructions to be repeated a number of times 
without any penalty for looping. This instruction activates the block repeat 
mode of updating the PC. The src operand Is a 24-bit unsigned immediate 
value that is loaded into the repeat end address (RE) register. A 1 is written 
Into the repeat mode bit of status register ST(RM) to indicate that the PC 
Is being updated in the repeat mode. The address of the next instruction is 
loaded into the repeat start address (RS) register. 


Cycles 4 


Status Bits 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit 


OVM Operation not affected by OVM. 


Example 


11 


RPTB 127h 

Before Instruction: 

PC =123h 
ST = Oh 
RE = Oh 
RS = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

PC =124h 
ST =100h 
RE = 127h 
RS =124h 

LUF LVUFNZVC-0000000 
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Repeat Single 


RPTS 


Syntax RPTS <src> 

Operation src -* RC 

1 ST(RM) 

1 S 

Next PC RS 
Next PC RE 


Operands src general addressing modes (G): 
0 0 register 
01 direct 
1 0 indirect 
1 1 immediate 


Encoding 


31 

24 23 


1615 

87 

0 

ilWI!! 


1 * 1 

G 


1 1 1 1 1 I 1 1 1 1 1 1 1 1 — I 

src 


Description The RPTS instruction allows a single instruction to be repeated a number 
of times without any penalty for looping. Fetches can also be made from the 
instruction register (IR), thus avoiding repeated memory access. 

The src operand is loaded into the repeat counter (RC). A 1 is written Into 
the repeat mode bit of the status register ST(RM). A 1 is also written into 
the repeat single bit (S). This indicates that the program fetches are to be 
performed only from the Instruction register, the repeat single mode. The 
next PC is loaded into the repeat end address (RE) register and the repeat 
start address (RS) register. 

The src operand is assumed to be an unsigned integer and is not sign-ex¬ 
tended for immediate mode. 


Cycles 4 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected, 

Mode Bit 

OVM 

Operation not affected by OVM 
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RPTS 


Repeat Single 


Example RPTS AR5 

Before instruction: 

PC =123h 
ST = Oh 
RS = Oh 
RE = Oh 
RC = Oh 
AR5 = OFFh 

LUFLVUFNZVC = 0000000 

After Instruction: 

PC =124h 
ST =lOOh 
RS =124h 
RE = 124h 
RC = OFFh 
AR5 = OFFh 

LUFLVUFNZVC = 0000000 


11 


11-148 




Signal, Interlocked 


SIGI 


Syntax 


SIGI 



Operation 

Signal interlocked operation. 

Walt for Interlock acknowledge. 

Clear Interlock. 

Operands 

None 



Encoding 




31 


24 23 


1615 87 0 



BOg 

gg 

—I" I—I ”i—I— r—~[ —I—r~i—r“ ' I' ' i" '■ "i—i"— t— i r i i— 

000000000000000000000 


Description An interlocked operation is signaled over XFO and XF1. After the interlocked 
operation is acknowledged, the interlocked operation ends. SIGI ignores 
the external ready signals. Refer to Section 7.3 for detailed information. 


Cycles 1 


Status Bits 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit 


OVM Operation not affected by OVM. 


11 
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STF 


Store Floating-Point 


Syntax STF <src>,<dst> 

Operation src dst 

Operands src register (Rn, 0 ^ n 7) 

dst general addressing modes (G): 

01 direct 
1 0 indirect 

Encoding 



Description The src register is loaded into the dst memory location. The src and dst 
operands are assumed to be floating-point numbers. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example STF R2 , @98Alh 

Before Instruction: 

DP = 80h 

R2 = 052C501900h = 4.30782204e+01 
Data at 8098A1 h = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R2 = 052C501900h = 4.30782204e+01 

Data at 8098A1 h = 52C5019h = 4.30782204e+01 

LUFLVUFNZVC = 0000000 
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Store Floating-Point, Interlocked 


STFI 


Syntax STFI <src>,<dst> 

Operation src dst 

Signal end of interlocked operation. 

Operands src register (Rn,0<n<7) 

dst general addressing modes (G): 
0 1 direct 
1 0 indirect 


Encoding 


31 24 23 1615 8 7 0 


□QQ 

—I—I—I—I—I— 

10 10 0 1 

G 

—1—1—1—1— 

src 

—1—1—1—1—1—1—1—1—1—1—1—1—1—1—1— 
dst 


Description The src register is loaded into the dst memory location. An interlocked op¬ 
eration is signaled over pins XFO and XF1. The src and dst operands are 
assumed to be floating-point numbers. Refer to Section 7.3 for detailed 
information. 


Cycles 

1 


Status Bits 

N 

Unaffected. 


Z 

Unaffected. 


V 

Unaffected. 


c 

Unaffected. 


UF 

Unaffected. 


LV 

Unaffected. 


LUF 

Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example STFI R3,*-AR4 

Before Instruction: 

R3 = 0733C00000h = 1.79750e+02 
AR4 = 80993Ch 
Data at 80993Bh = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

R3 = 0733C00000h = 1.79750e+02 
AR4 = 80993Ch 

Data at 80993Bh = 733C000h = 1.79750e+02 
LUFLVUFNZVC = 0000000 


11 
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STFIISTF 


Parallel STF and STF 


Syntax STF <src2> <dst2> 

II STF <src1>,<dst'\> 

Operation src2 dstl 

II srcl dst^ 

Operands srcl register (Rn1,0 n1 ^7) 

dstl indirect (disp = 0, 1, IRO, IR1) 
src2 register (Rn2, 0 £ n2 < 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 



Description Two floating-point stores are performed in parallel. If both stores are exe¬ 
cuted to the same address, the value written is that of STF 
<src2><dst2>. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


11 
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Parallel STF and STF 


STFIISTF 


Example 


STF R4,*AR3— 

I I STF R3,*++AR5 

Before instruction: 

R4 = 070C800000h = 1.4050e+02 
AR3 = 809835h 

R3 = 0733C00000h = 1.79750e+02 
AR5 = 8099D2h 
Data at 809835h = Oh 
Data at 8099D3h = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

R4 = 070C800000h = 1.4050e+02 
AR3 = 809834h 

R3 = 0733C00000h = 1.79750e+02 
ARB = 8099D3h 

Data at 809835h = 070C8000h = 1.4050e+02 
Data at 8099D3h = 0733C000h = 1.79750e+02 
LUFLVUFNZVC = 0000000 


11 
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STI 


Store Integer 


Syntax STI <src>,<dst> 

Operation src dst 

Operands src register (Rn, 0 ^ n 27) 

dst general addressing modes (G): 
0 1 direct 
1 0 indirect 

Encoding 



Description The src register is loaded into the dst memory location. The src and dst 
operands are assumed to be signed integers. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example STI R4,@982Bh 

Before Instruction: 

DP = 80h 

R4 = 42BD7h = 273,367 

Data at 80982Bh = OEBFCh = 58,876 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R4 = 42BD7h = 273,367 

Data at 80982Bh = 42BD7h = 273,367 

LUFLVUFNZVC = 0000000 
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Store Integer, Interlocked 


STM 


Syntax STM <src>,<dst> 

Operation src dst 

Signal end of interlocked operation. 

Operands src register (Rn, 0 ^ n 27) 

dst general addressing modes (G): 

01 direct 
1 0 indirect 

Encoding 



Description The src register is loaded Into the dst memory location. An interlocked op¬ 
eration Is signaled over pins XFO and XF1. The src and dst operands are 
assumed to be signed Integers. Refer to Section 7.3 for detailed Information. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example STII Rl,@98AEh 

Before Instruction: 

DP = 80h 
R1 = 78Dh 

Data at 8098AEh = 25Ch 

After Instruction: 

DP = 80h 
R1 = 78Dh 

Data at 8098AEh = 7BDh 
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STIIISTI 


Parallel STI and STI 


Syntax STI <src2>,<dst2> 

II STI <src1 >,<dst'[ > 

Operation src2 dstl 

II src1 dst^ 

Operands src1 register (Rnl, 0 n1 ^7) 

dst^ indirect (disp = 0, IRO, IR1) 
src2 register (Rn2, 0 ^ n2 ^ 7) 
dstl indirect (disp = 0, 1, IRO, IR1) 

Encoding 



Description Two integer stores are performed in parallel. If both stores are executed to 
the same address, the value written is that of STI <src2> <dst2>. 

Cycles 1 

Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


11 
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Parallel STI and STI 


STIIISTI 


Example 


STI R0,*++AR2(IR0) 

II STI R5,*AR0 

Before instruction: 

RO = ODCh = 220 

AR2 = 809830h 

IRO = 8h 

R5 = 35h = 53 

ARO = 8098D3h 

Data at 809838h = Oh 

Data at 8098D3h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 


RO = ODCh = 220 

AR2 = 809838h 

IRO = 8h 

R5 = 35h = 53 

ARO = 8098D3h 

Data at 809838h = ODCh = 220 

Data at 8098D3h = 35h = 53 

LUFLVUFNZVC = 0000000 




SUBB 


Subtract Integer with Borrow 


Syntax SUBB <src>,<dst> 

Operation dst - src - C ^ dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 ^ n < 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n < 27) 


Encoding 


31 


24 23 


1615 

87 

0 


DB 

miQ 

1 1 1 

G 

1 , 1 , 

dst 

I 1 I 1 1 1 1 1 I 1 1 1 1 1 1 

src 


Description The difference of the dst, src, and C operands is loaded Into the dst register. 
The dst and src operands are assumed to be signed integers. 


Cycles 1 


Status Bits 


N 1 if a negative result is generated, 0 otherwise. 

Z 1 If a zero result Is generated, 0 otherwise. 

V 1 If an Integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 if an integer overflow occurs, unchanged otherwise. 
LUF Unaffected. 


Mode Bit OVM Operation affected by OVM. 


Example 


11 


SUBB *AR5++(4),R5 

Before instruction: 

AR5 = 809800h 

R5 = OFAh = 250 

Data at 809800h = 0C7h = 199 

LUF LVUFNZVC = 0000001 

After Instruction: 

ARB = 809804h 

R5 = 032h = 50 

Data at 809800h = 0C7h = 199 

LUF LVUFNZVC = 0000000 
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Subtract Integer with Borrow, 3-Operand 


SUBB3 


Syntax SUBB3 <src2>,<src1> <dst> 

Operation srcl - src2 - C dst 

Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1, 0 ^ n1 ^ 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rn1,0 ^ n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 ^ n2 ^ 27) 

0 1 register (Rn2, 0 ^ n2 ^ 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 < n < 27) 

Encoding 


31 24 23 1615 8 7 0 



Description The difference of the srcl and src2 operands and the C (carry) flag is 

loaded Into the dst register. The srcl, src2, and dst operands are assumed 

to be signed Integers. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 If a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 if an Integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 


11 


11-159 





SUBB3 


Subtract Integer with Borrow, 3-Operand 


Example 


SUBB3 R5,*AR5++(IRO),RO 

Before instruction: 

ARB = 809800h 
IRO = 4h 

R5 = 0C7h = 199 
RO = Oh 

Data at 809800h = OFAh = 250 

LUF LVUFNZVC = 0000001 

After Instruction: 

ARB = 809804h 
IRO = 4h 

R5 = 0C7h = 199 
RO = 32h = 50 

Data at 809800h = OFAh = 250 
LUFLVUFNZVC = 0000000 



Subtract Integer Conditionally 


SUBC 


Syntax 

Operation 

Operands 


Encoding 

31 



SUBC <src>,<dst> 

If (dst - src ^ 0): 

{dst - src << 1) OR 1 -► dst 
Else; 

dst << ^ dst 

src general addressing modes (G): 
0 0 register (Rn, 0 ^ n 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n 27) 


24 23 


1615 


87 


0 



Description A subtraction of the src operand from the dst operand is performed. The 
dst operand is loaded with a value dependent upon the result of the sub¬ 
traction. If {dst - src) is greater than or equal to zero, then {dst - src) is 
left-shifted one bit, the least-significant bit is set to 1, and the result is 
loaded into the dst register. If {dst - src) Is less than zero, dst Is left-shifted 
one bit and loaded into the dst register. The dst and src operands are as¬ 
sumed to be unsigned integers. 

SUBC may be used to perform a single step of a multi-bit integer division. 
See Section 12.3.3 for a detailed description. 

Cycles 1 


Status Bits N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 
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SUBC 


Subtract Integer Conditionally 


Example SUBC @98C5h,Rl 

Before Instruction: 

DP = 80h 

R1 = 04F6h =1270 

Data at 8098C5h = 492h = 1170 

LUFLVUFNZVC = 0000000 

After instruction: 

DP = 80h 

R1 = 0C9h = 201 

Data at 8098C5h = 492h = 1170 

LUFLVUFNZVC = 0000000 

Example SUBC 3000,RO (3000 = 0BB8h) 

Before instruction: 

RO = 07D0h = 2000 

LUFLVUFNZVC = 0000000 
After instruction: 

RO = OFAOh = 4000 

LUFLVUFNZVC = 0000000 
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Subtract Floating-Point 


SUBF 


Syntax SUBF <src>,<c/st> 

Operation dst - src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n ^ 7) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 7) 

Encoding 



Description The dst operand minus the src operand is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an floating-point overflow occurs, 0 otherwise. 

C Unaffected 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if an floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 

Example SUBF *ARO-—(IRQ) ,R5 

Before Instruction: 

ARO = 809888h 
IRQ = 80h 

R5 = 0733C00000h = 1.79750000e+02 
Data at 809888h = 70C8000h = 1.4050e+02 
LUFLVUFNZVC = 0000000 

After Instruction: 

ARO = 809808h 
IRO = 80h 

R5 = 051 DOOOOOOh = 3.9250e+01 

Data at 809888h = 70C8000h = 1.4050e+02 

LUFLVUFNZVC = 0000000 
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SUBF3 


Subtract Floating-Point, 3-Operand 


Syntax SUBF3 <src2> <src1>,<dst> 

Operation srcl - src2 -♦ dst 

Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1, ^ ^7) 

0 1 indirect (disp = 0,1, IRO, IR1) 

1 0 register (Rn1, ^ nl < 7) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 

0 0 register (Rn2, ^ n2 ^ 7) 

0 1 register (Rn2, < n2 < 7) 

1 0 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 n ^ 7) 

Encoding 

31 24 23 1615 8 7 0 



Description The difference of the srcl and src2 operands is loaded into the dst register. 

The src1, src2, and dst operands are assumed to be floating-point numbers. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 If a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 
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Subtract Floating-Point, 3-Operand 


SUBF3 


Example 


Example 


SUBF3 *ARO—(IRO),*AR1,R4 

Before instruction: 

ARO = 809888h 
IRO = 80h 
AR1 = 809851 h 
R4 = Oh 

Data at 809888h = 70C8000h = 1.4050e+02 
Data at 809851 h = 733C000h = 1.79750e+02 
LUFLVUFNZVC = 0000000 

After Instruction: 

ARO = 809808h 
IRO = 80h 
AR1 = 809851 h 

R4 = 51 DOOOOOOh = 3.9250e+01 
Data at 809888h = 70C8000h = 1.4050e+02 
Data at 809851 h = 733C000h = 1.79750e+02 
LUFLVUFNZVC = 0000000 

SUBF3 R7,R0,R6 

Before Instruction: 

R7 = 57B400000h = 6.281250e+01 
RO = 34C200000h = 1.27578125e+01 
R6 = Oh 

LUF LVUFNZVC = 0000000 

After Instruction: 

R7 = 57B400000h = 6.281250e+01 
RO = 34C200000h = 1.27578125e+01 
R6 = 5B7C80000h = >5.00546875e-f01 
LUF LV UF N Z V C = 0 0 0 1 0 0 0 



SUBF3IISTF 


Parallel SUBF3 and STF 


Syntax SUBF3 <src1>,<src2>,<dst^> 

II STF <src3> <dst2> 

Operation src2 - srcl dst^ 

11 src3 -► dst2 

Operands srcl register (Rn1,0 < n1 ^7) 

src2 indirect (disp = 0, 1, IRQ, IR1) 
dsti register (Rn2, 0 ^ n2 ^ 7) 
src3 register (Rn3, 0 ^ n3 7) 
dst2 indirect (disp = 0,1, IRQ, IR1) 

Encoding 


31 


24 23 


1615 


8 



Description A floating-point subtraction and a floating-point store are performed in 
parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that If one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (SUBF3) 
writes to the same register, then STF accepts as input the contents of the 
register before it is modified by the SUBF3. 

If src2 and dst2 point to the same location, src2 Is read before the write to 
dst2. 

Cycles 1 

Status Bits IM 1 if a negative result is generated, 0 otherwise. 

Z 1 If a zero result Is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 If a floating-point underflow occurs, 0 otherwise. 

LV 1 If a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 
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Parallel SUBF3 and STF 


SUBF3IISTF 


Example 


SUBF3 R1,*-AR4(IR1),R0 
I I STF R7,*+AR5(IR0) 

Before Instruction: 

R1 = 057B400000h = 6.28125e+01 
AR4 = 8098B8h 
IR1 = 8h 
RO = Oh 

R7 = 0733C00000h = 1.79750e+02 
ARB = 809850h 
IRO = lOh 

Data at 8098B0h = 70C8000h = 1.4050e+02 
Data at 809860h = Oh 

LUF LVUFNZVC = 0000000 

After Instruction: 

R1 = 057B400000h = 6.28125e+01 
AR4 = 8098B8h 
IR1 = 8h 

RO = 061 B600000h = 7.768750e+01 
R7 = 0733C00000h = 1.79750e+02 
ARB = 8098B0h 
IRO = lOh 

Data at 8098B0h = 70C8000h = 1.40B0e+02 
Data at 809860h = 733C000h = 1.797B0e+02 
LUFLVUFNZVC = 0000000 


11 
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SUBI 


Subtract Integer 


Syntax SUBI <src>,<c/st> 

Operation dst - src dst 


Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n ^ 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 < n ^ 27) 


Encoding 


31 

24 23 


1615 

87 

0 

WWW 

QQQQQQ 

B 

1 1 1 1 

dst 

1 1 I 1 1 1 1 1 1 1 1 I 1 1 I 

src 


Description The dst operand minus the src operand is loaded into the dst register. The 
dst and src operands are assumed to be signed integers. 


Cycles 1 


Status Bits 


N 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 if an integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 if an Integer overflow occurs, unchanged otherwise. 
LUF Unaffected. 


Mode Bit OVM Operation affected by OVM. 


Example 


11 


SUBI 220,R7 

Before Instruction: 

R7 = 226h = 550 

LUFLVUFNZVC = 0000000 
After Instruction: 

R7 = 14Ah = 330 

LUFLVUFNZVC = 0000000 
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Subtract integer, 3-Operand 


SUBI3 


Syntax SUBI3 <src2>,<src1>,<dst> 

Operation srcl - src2 dst 

Operands src1 three-operand addressing modes (T): 

0 0 register (Rnl, 0 ^ n1 < 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 

1 0 register (Rnl, 0 n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 

0 0 register (Rn2, 0 ^ n2 ^ 27) 

0 1 register (Rn2, 0 ^ n2 < 27) 

1 0 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 

31 24 23 1615 8 7 0 



Description The src1 operand minus the src2 operand is loaded into the dst register. The 
src1, src2, and dst operands are assumed to be signed integers. 

Cycles 1 

Status Bits N 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 if an Integer overflow occurs, 0 otherwise. 

C 1 If a borrow occurs, 0 otherwise. 

UF 0 

LV 1 if an Integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 

Example SUBI3 R7,R2,R0 

Before instruction: 

R2 = 0866h = 2150 
R7 = 0834h = 2100 
RO = Oh 

LUFLVUFNZVC = 0000000 

After instruction: 

R2 = 0866h = 2150 
R7 - 0834h = 2100 
RO = 032h = 50 

LUF LV UF N Z V C = 0 0 0 1 0 0 0 
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SUBiS 


Subtract Integer, 3-Operand 


Example 


SUBI3 *-AR2(l),R4,R3 

Before Instruction: 

AR2 = 80985Eh 
R4 = 0226h = 550 
R3 = Oh 

Data at 80985Dh = ODCh = 220 

LUF LVUFNZVC = 0000000 

After instruction: 

AR2 = 80985Eh 

R4 = 0226h = 550 

R3 = 014Ah = 330 

Data at 80985Dh = ODCh = 220 

LUF LVUFNZVC = 0000000 



Parallel SUBI3 and STI 


SUBI3IISTI 


Syntax SUBI3 <src7>,<src2>,<c/st^> 

II STI <src3>,<dst2> 


Operation src2 - srcl dst^ 

11 src3 dstl 


Operands srcl register (Rn1, 0 ^ n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 
dstl register (Rn2, 0 ^ n2 7) 
src3 register (Rn3, 0 ^ n3 ^ 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 

Encoding 


31 24 23 1615 8 7 0 


1' 1 


mmm 

— 1 —1— 
srcl 

mmm 

—1—1— \ —1—1—1—1— 

dst2 

—I—1—1—1—1—1—1— 

src2 


Description An integer subtraction and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (SUBI3) writes to the 
same register, then STI accepts as Input the contents of the register before 
it Is modified by the SUBI3. 


If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 


Cycles 1 


Status Bits 


N 1 If a negative result is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 If an Integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 If an integer overflow occurs, unchanged otherwise. 
LUF Unaffected. 


Mode Bit 


OVM Operation affected by OVM. 


11 
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SUBt3||STI 


Parallel SUBI3 and STI 


Example 


SUBI3 R7,*+AR2(IR0),R1 
I I STI R3,*++AR7 

Before Instruction: 


R7 = 14h = 20 

AR2 = 80982Fh 

IRO =10h 

R1 = Oh 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80983Fh = ODCh = 220 

Data at 80983Ch = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 


R7 = 14h = 20 

AR2 = 80982Fh 

IRO = 10h 

R1 = 0C8h = 200 

R3 = 35h = 53 

AR7 = 80983Ch 

Data at 80983Fh = ODCh = 220 

Data at 80983Ch = 35h = 53 

LUFLVUFNZVC = 0000000 



Subtract Reverse Integer with Borrow 


SUBRB 


Syntax SUBRB <src>,<dst> 

Operation src - dst - C dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n ^ 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n 27) 

Encoding 



Description The difference of the src, dst, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 

Cycles 1 

Status Bits N 1 if a negative result Is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 If an integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 If an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 

Example SUBRB R4, R6 

Before instruction: 

R4 = OSCBh = 971 
R6 = 0258h = 600 

LUF LV UF N Z V C = 0 0 0 0 0 0 1 

After instruction: 

R4 = OSCBh = 971 
R6 = 0172h = 370 

LUF LVUFNZVC = 0000000 
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SUBRF 


Subtract Reverse Floating-Point 


Syntax SUBRF <src>,<dst> 

Operation src - dst dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 n ^ 7) 

01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n 7) 

Encoding 



Description The src operand minus the dst operand is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result Is generated, 0 otherwise. 

V 1 if a floating-point overflow occurs, 0 otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, 0 otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

Mode Bit OVM Operation not affected by OVM. 

Example SUBRF @9905h,R5 

Before Instruction: 

DP = 80h 

R5 = 057B400000h = 6.281250e-f01 

Data at 809905h = 733C000h = 1.79750e+02 

LUF LVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R5 = 0669E00000h = 1.16937500e+02 
Data at 809905h = 733C000h = 1.79750e-f02 
LUFLVUFNZVC = 0000000 
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Subtract Reverse Integer 


SUBRI 


Syntax SUBRI <src>.<dst> 

Operation src - dst dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 n ^ 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n 27) 

Encoding 



Description The src operand minus the dst operand is loaded into the dst register. The 
dst and src operands are assumed to be signed integers. 

Cycles 1 

Status Bits N 1 if a negative result is generated, 0 otherwise. 

Z 1 if a zero result is generated, 0 otherwise. 

V 1 If an Integer overflow occurs, 0 otherwise. 

C 1 if a borrow occurs, 0 otherwise. 

UF 0 

LV 1 If an Integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

Mode Bit OVM Operation affected by OVM. 

Example SUBRI * AR5+-f (IRQ) , R3 

Before Instruction: 

AR5 = 809900h 
IRQ = 8h 

R3 = ODCh = 220 

Data at 809900h = 226h = 550 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR5 = 809908h 
IRO = 8h 

R3 = 014Ah = 330 

Data at 809900h = 226h = 550 

LUFLVUFNZVC = 0000000 
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SWI 


Software Interrupt 


Syntax 

SWI 

Operation 

Performs an emulation interrupt 

Operands 

None 

Encoding 



31 


24 23 1615 8 7 0 



—1—1 ■ 1 I 1 ""I —1 n—1—1 i 1 — 1 — 1 — 1—1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 — T” 

0000000000000000000000000 


Description The SWI instruction performs an emulator interrupt. This is a reserved in¬ 
struction and should not be used in normal programming. 


Cycles 4 


Status Bits 


N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


11 
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Trap Conditionally 


TRAPconcf 


Syntax JRAPcond N 

Operation 0 ST(GIE) 

If cond is true: 

Next PC * + + SP 

Trap vector N -► PC 
Else: 

Set ST(GIE) to original state 
Continue. 


Operands N (0 < N ^ 31) 
Encoding 


31 24 23 1615 8 7 0 




IIIIIIQQQIIIIIII^ 

QQQQQQQQQQH 

BQHl 


Description Interrupts are disabled globally when 0 is written to ST(GIE). If the con¬ 
dition Is true, the contents of the PC are pushed on the system stack and 
the PC Is loaded with the contents of the specified trap vector (N). If the 
condition Is not true, ST(GIE) is set to its value before the IPAPcond in¬ 
struction changed It. 


Cycles 
Status Bits 


The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 9.1 for a list of condition mnemonics, encoding, 
and flags). 

5 

N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

UF Unaffected. 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example TRAP Z 16 

Before instruction: 

PC =123h 
SP = 809870h 
ST = Oh 

Trap Vector 16 = lOh 

LUFLVUFNZVC = 0000000 

After Instruction: 

PC = lOh 

SP = 809871 h 

Data at 809871 h = 124h 

ST = Oh 

LUFLVUFNZVC = 0000000 


11 
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TSTB 


Test Bit Fields 


Syntax TSTB <src>,<dst> 

Operation dst AN D src 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 £ n ^ 27) 
01 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 £ n 27) 


Encoding 


31 

24 23 


1615 

87 

0 

QQQ 

QQQQQQ 

Q 

1 1 1 1 

dst 

1 1 1 1 1—1 1 1 1 1 1 1 1 1 1 

src 


Description 


Cycles 
Status Bits 


The bitwise logical-AND of the dst and src operands is formed, but the re¬ 
sult Is not loaded In any register. This allows for nondestructive compares. 
The dst and src operands are assumed to be unsigned Integers. 

1 

N MSB of the output. 

Z 1 if a zero output Is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


Example 


11 


TSTB *-AR4(l),R5 

Before Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1895 

LUF LVUFNZVC = 0000000 

After Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1895 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 
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Test Bit Fields, 3-Operand TSTB3 


Syntax TSTB3 <src2>,<src1> 

Operation srcl AND src2 

Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1,0 ^ n1 ^ 27) 

0 1 indirect (disp = 0, 1JRO, IR1) 
1 0 register (Rnl, 0 ^ n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRQ, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 ^ n2 ^ 27) 

0 1 register (Rn2, 0 n2 ^ 27) 

1 0 indirect (disp = 0, UR0JR1) 
1 1 indirect (disp = 0, 1, IRQ, IR1) 

Encoding 


31 24 23 1615 8 7 0 



Description The bitwise logical-AND between the src1 and src2 operands is formed, 
but is not loaded into any register. This allows for nondestructive compares. 
The src1 and src2 operands are assumed to be unsigned Integers. Although 
this instruction has only two operands, it is designated as a three operand 
instruction since operands are specified In the three operand format. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output Is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 


11 
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TSTB3 


Test Bit Fields, 3-Operand 


\ 


Example TSTB3 *AR5— (IRQ) , *+ARO (1) 

Before instruction: 

AR5 = 809885h 

IRQ = 80h 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1895 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR5 = 809805h 

IRO = 80h 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1895 

LUF LV UF N Z V C = 0 0 0 0 1 0 0 

Example TSTB3 R4,*AR6—(IRO) 

Before Instruction: 

R4 = 0FBC4h 
AR6 = 8099F8h 
IRO = 8h 

Data at 8099F8h = 1568h 
LUFLVUFNZVC = 0000000 

After Instruction: 

R4 = 0FBC4h 
AR6 = 8099F0h 
IRO = 8h 

Data at 8099F8h = 1568h 

LUF LVUFNZVC = 0000000 


11 
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Bitwise Exclusive-OR 


XOR 


Syntax XOR <src>,<dst> 

Operation dst XOR src dst 

Operands src general addressing modes (G): 

0 0 register (Rn, 0 < n ^ 27) 
0 1 direct 
1 0 indirect 
1 1 immediate 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 



Description The bitwise exclusive-OR of the src and dst operands is loaded into the 
dst register. The dst and src operands are assumed to be unsigned integers. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 

Example XOR R1,R2 

Before Instruction: 

R1 = 0FFA32h 
R2 = OFFBCIh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = 0FF3A2h 
R2 = 000FF3h 

LUFLVUFNZVC = 0000000 
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X0R3 


Bitwise Exciusive-OR, 3-Operand 


Syntax X0R3 <src2>,<src1>,<dst> 

Operation srcl XOR src2 dst 


Operands srcl three-operand addressing modes (T): 

0 0 register (Rn1, 0 ^ n1 ^ 27) 

0 1 indirect (disp = 0, 1, IRO, IR1) 
1 0 register (Rn1,0 ^ n1 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
0 0 register (Rn2, 0 < n2 ^ 27) 

0 1 register (Rn2, 0 n2 27) 

1 0 indirect (disp = 0,1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, 0 ^ n ^ 27) 

Encoding 


31 24 23 1615 8 7 0 


BOO 

QQQQQQ 

a 





Description 


Cycles 
Status Bits 


The bitwise exclusive-OR between the srcl and src2 operands is loaded 
into the dst register. The srcl, src2, and dst operands are assumed to be 
unsigned integers. 

1 

IM MSB of the output. 

Z 1 if a zero output is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 


Mode Bit OVM Operation not affected by OVM. 


11 
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Bitwise Exclusive-OR, 3-Operand 


XOR3 


Example 


Example 


X0R3 *AR3++(IR0),R7,R4 

Before Instruction: 

AR3 = 809800h 
IRQ =10h 
R7 = OFFFFh 
R4 = Oh 

Data at 809800h = 5AC3h 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 80980Fh 

IRO = 10h 

R7 = OFFFFh 

R4 = 0A53Ch 

Data at 809800h = 5AC3h 

LUFLVUFNZVC = 0000000 

XOR3 R5,*-AR1(1),R1 

Before Instruction: 

R5 = 0FFA32h 
AR1 = 809826h 
R1 = Oh 

Data at 809825h = 0FF5C1h 
LUFLVUFNZVC = 0000000 

After Instruction: 

R5 = 0FFA32h 

AR1 = 809826h 

R1 = 000F33h 

Data at 809825h = 0FF5C1 h 

LUFLVUFNZVC = 0000000 


11 
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X0R3I1STI 


Parallel X0R3 and STI 


Syntax X0R3 <src2>,<src1>,<dst1> 

II STI <src3>,<dst2> 

Operation srcl XOR src2 dsti 

11 src3 dst2 

Operands srcl register (Rn1,0 < n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn2, s nl <1) 
src3 register (Rn3, n3 < 7) 
dst2 indirectO(disp = 0, 1, IRO, IR1) 

Encoding 



Description A bitwise exclusive-XOR and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (XOR3) writes to the 
same register, then STI accepts as Input the contents of the register before 
it is modified by the XOR3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Cycles 1 

Status Bits N MSB of the output. 

Z 1 if a zero output Is generated, 0 otherwise. 

V 0 

C Unaffected. 

UF 0 

LV Unaffected. 

LUF Unaffected. 

Mode Bit OVM Operation not affected by OVM. 
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Parallel XOR3 and STI 


X0R3IISTI 


Example 


X0R3 *AR1++,R3,R3 
II STI R6,*-AR2(IR0) 

Before instruction: 

AR1 = 80987Eh 
R3 = 85h 
R6 = ODCh = 220 
AR2 = 8098B4h 
IRO = 8h 

Data at 80987Eh = 85h 
Data at 8098ACh = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

ARl = 80987Fh 
R3 - Oh 

R6 = ODCh = 220 
AR2 = 8098B4h 
IRO = 8h 

Data at 80987Eh = 85h 

Data at 8098ACh = ODCh = 220 

LUFLVUFNZVC = 0000000 
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Section 12 


Software Applications 


The TMS320C30 is a very powerful digital signal processor with architecture 
and instruction set designed to make easy system solutions to DSP problems. 
There are Instructions specifically designed for efficient implementations of 
DSP algorithms, but also there are general-purpose Instructions that make the 
device suitable for more general tasks, like any microprocessor. The floating 
point and integer arithmetic supported by the device permits the designer to 
concentrate on the algorithm with minimal concerns about scaling, dynamic 
range, and overflows. 

The purpose of this section is to explain how to use the Instruction set, the 
architecture, and the interface of the TMS320C30 processor. This is done by 
presenting coding examples for very frequently used applications, and by dis¬ 
cussing more Involved examples and applications. In all cases, besides ex¬ 
plaining the principles involved in the application, the corresponding 
assembly-language code is given for instructional purposes and for immediate 
use. Whenever the detailed explanation of the underlying theory is too ex¬ 
tensive to be included in this manual, appropriate references are given for 
further information. 

Major topics discussed In this section are listed below. 

• Processor Initialization (Section 12.1 on page 12-3) 

• Program Control (Section 12.2 on page 12-7) 

- Subroutine calls 

- Software stack 

“ Interrupt handling 

- Delayed branches 

- Repeat modes 

- Computed GOTO's 

• Logical and Arithmetic Operations (Section 12.3 on page 12-20) 

- Bit manipulation 

- Block moves 

- Bit-reversed addressing 

- Division 

- Square root 

- Extended-precision arithmetic 

- IEEE < = = > C30 floating point conversions 

• Application-oriented Operations (Section 12.4 on page 12-45) 

- Companding (A-law, M-law) 

FIR / MR filters (fixed and adaptive) 

- Matrix math 
FFT 
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Lattice filters 

• Programming Tips (Section 12.5 on page 12-87) 
“ C-callable Routines 
- Code Optimization Check-list 
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12.1 Processor Initialization 

Prior to the execution of a digital signal processing algorithm, it is necessary 
to initialize the processor. Generally, initialization takes place any time the 
processor is reset. 

When reset is activated by applying a low level to the RESET input for several 
cycles, the TMS320C30 terminates execution and puts the reset vector (i.e., 
the contents of memory location 0) in the program counter. The reset vector 
normally contains the address of the system initialization routine. The hard¬ 
ware reset also Initializes various registers and status bits. 

After reset, the processor should be initialized to meet the requirements of the 
system. Instructions should be executed that set up operational modes, me¬ 
mory pointers, interrupts, and the remaining functions needed to meet system 
requirements. 

To configure the processor at reset, the following internal functions should 
be initialized: 

• Memory-mapped registers 

• Interrupt structure 


Example 12-1 shows coding for initializing the TMS320C30 to the following 
machine state, in addition to the Initialization performed during the hardware 
reset (for conditions after hardware reset, see section 13): 

• All Interrupts are enabled. 

• The overflow mode Is disabled. 

• The data memory page pointer is set to zero. 

• The Internal memory is filled with zeros. 

Note that all constants larger than 16 bits should be placed in memory and 
accessed through direct or indirect addressing. 




Software Applications - Processor Initialization 


Example 12-1. TMS320C30 Processor Initialization 


* 

* TITL 'PROCESSOR INITIALIZATION EXAMPLE' 

* 

.global RESET,INIT,BEGIN 
.global INTO,INTI,INT2,INT3 
.global ISR0,ISR1,ISR2,ISR3 
.global DINT,DMA 

.global TINTO,TINTl,XINTO,RINTO,XINTl,RINT1 
.global TIME0,TIME1,XMTO,RCVO,XMTl,RCVl 
.global TRAPO,TRAP1,TRAP2,TRPO,TRP1,TRP2 

* 

* PROCESSOR INITIALIZATION FOR THE TMS320C30. 

* 

* RESET AND INTERRUPT VECTOR SPECIFICATION. THIS 

* ARRANGEMENT ASSUMES THAT DURING LINKING, THE FOLLOWING 

* TEXT SEGMENT WILL BE PLACED TO START AT MEMORY 

* LOCATION 0. 

* 



. sect 

"init" ; 

f Named section 

RESET 

•k 

.word 

INIT ] 

; RS- loads address INIT to PC 

INTO 

.word 

ISRO 

INTO- loads 

address INTO to PC 

INTI 

.word 

ISRl 

INTI- loads 

address INTI to PC 

INT2 

.word 

ISR2 

INT2- loads 

address INT2 to PC 

INT3 

•k 

.word 

ISR3 

INT3- loads 

address INT3 to PC 

XINTO 

.word 

XMTO 

Serial port 

0 transmit processing 

RINTO 

.word 

RCVO 

Serial port 

0 receive processing 

XINTO 

.word 

XMTl 

Serial port 

1 transmit processing 

RINTO 

.word 

RCVl 

Serial port 

1 receive processing 

TINTO 

.word 

TIMEO 

Timer 0 interrupt processing 

TINTl 

.word 

TIMEl 

Timer 1 interrupt processing 

DINT 

.word 

DMA 

DMA interrupt 


. space 

20 

Reserved space 

TRAPO 

. word 

TRPO 

Trap 0 vector processing begins 

TRAPl 

.word 

TRPl 

Trap 1 vector processing begins 

TRAP 2 

.word 

TRP 

Trap 2 vector processing begins 


. space 

29 

} Leave space 

for the other 29 traps 


* 


12 


* IN THIS SECTION, CONSTANTS THAT CANNOT BE REPRESENTED 

* IN THE SHORT FORMAT ARE INITIAIIZED. 

. data 


MASK 

. word 

BLKO 

.word 

BLKO 

.word 

STCK 

.word 

CTRL 

.word 

DMACTL 

.word 

TIMOCTL 

.word 

TIMICTL 

.word 

SERGLOBO 

.word 

SERPRTXO 

.word 

SERPRTRO 

.word 

SERTIMO 

.word 

SERGLOBl 

.word 

SERPRTXl 

.word 

SERPRTRl 

.word 

SERTIMl 

.word 

PARINT 

.word 

lOINT 

. word 


OFFFFFFFFH 

0809800H 

0809C00H 

0809F00H 

0808000H 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 


Beginning address of RAM block 0 
Beginning address of RAM block 1 
Beginning of stack 

Pointer for peripheral-bus memory map 
Initialization for DMA control (0) 
Initialization of timer 0 control (32) 
Initialization of timer 1 control (48) 


Init of serial 
Init of serial 


0 glbl control (64) 

0 xmt port control (66) 
0 rev port control (67) 
Init of serial 0 timer control (68) 
Init of serial 1 glbl control (80) 

1 xmt port control (82) 
1 rev port control (83) 
1 timer control (84) 
Init parallel interface control (96) 
Init I/O interface control (100) 


Init of serial 


Init of serial 
Init of serial 
Init of serial 
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. text 

* 

* THE ADDRESS AT MEMORY LOCATION 0 DIRECTS EXECUTION TO BEGIN HERE 

* FOR RESET PROCESSING THAT INITIALIZES THE PROCESSOR. WHEN RESET 

* IS APPLIED, THE FOLLOWING REGISTERS ARE INITIALIZED TO ZERO: 

* 


* ST-CPU STATUS REGISTER 

* IE - CPU/DMA INTERRUPT ENABLE FLAGS 

* IF-CPU INTERRUPT FLAGS 

* lOF-I/O FLAGS 

* 


* THE STATUS REGISTER HAS THE FOLLOWING ARRANGEMENT: 

* BITS: 31-14 13 12 11 10 9 87 6 543210 

* FUNCTION: RESRV GIE CC CE CF RES RM OVM LUF LV UF N Z V C 

* 


INIT 

LDP 

0,DP 


LDI 

1800H,ST 

* 

LDI 

@MASK,IE 


; Point the DP register to page 0 
; Clear and enable cache, and disable OVM 
; Unmask all interrupts 


INTERNAL DATA MEMORY INITIALIZATION TO FLOATING POINT ZERO 



LDI 

@BLK0,AR0 


ARO points to block 0 


LDI 

@BLK1,AR1 


ARl points to block 1 


LDF 

0.0, RO 


Zero register RO 


RPTS 

1023 


Repeat 1024 times .. . 


STF 

R0,*AR0++(1) 


Zero out location in RAM block 0 

1 1 

■k 

STF 

R0,*AR1++(1) 


zero out location in RAM block 1 

* 

THE PROCESSOR IS INITIALIZED 

THE REMAINING APPLICATION- 

* 

DEPENDENT 

PART OF THE SYSTEM 

(BOTH ON- AND OFF-CHIP SHOULD 


and 


NOW BE INITIALIZED. 


* FIRST, INITIALIZE THE CONTROL REGISTERS. IN THIS EXAMPLE, 

* EVERYTHING IS INITIALIZED TO ZERO SINCE THE ACTUAL INITIAL- 

* IZATION IS APPLICATION DEPENDENT. 


LDI 

@CTRL,AR0 

; LOAD 

in ARO 

the pointer to control 



; registers 



LDI 

@DMACTL,RO 





STI 

R0,*+AR0(0) 

; Init 

DMA control 

LDI 

@TIM0CTL,R0 





STI 

R0,*+AR0(32) 

; Init 

timer 

0 

control 

LDI 

@TIM1CTL,R0 





STI 

R0,*+AR0(48) 

; Init 

timer 

1 

control 

LDI 

@SERGLOBO,RO 





STI 

R0,*+AR0(64) 

; Init 

serial 

0 

global control 

LDI 

@SERPRTX0,R0 





STI 

R0,*+AR0(66) 

; Init 

serial 

0 

xmt control 

LDI 

@SERPRTR0,R0 





STI 

R0,*+AR0(67) 

; Init 

serial 

0 

rev control 

LDI 

@SERTIM0,R0 





STI 

R0,*+AR0(68) 

; Init 

serial 

0 

timer control 

LDI 

@SERGLOBL,RO 





STI 

R0,*+AR0(80) 

; Init 

serial 

1 

global control 

LDI 

@SERPRTX1,R0 





STI 

R0,*+AR0(82) 

; Init 

serial 

1 

xmt control 

LDI 

@SERPRTR1,R0 





STI 

R0,*+AR0(83) 

; Init 

serial 

1 

rev control 
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LDI 

@SERTIM1,R0 



STI 

R0,*+AR0(84) 


Init serial timer control 

LDI 

@PARINT,RO 



STI 

R0,*+AR0(96) 


Init parallel interface control 

LDI 

@IOINT,RO 


STI 

R0,*+AR0(100) 


Init I/O interface control 

LDI 

@STCK,SP 


Initialize the stack pointer 

OR 

2000H,ST 


Global interrupt enable 

BR 

BEGIN 


Branch to the beginning of application 

. end 
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12.2 Program Control 

To facilitate the TMS320C30's use in general-purpose, high-speed process¬ 
ing, a variety of instructions are provided to handle the; 

• subroutine calls 

• software stack 

• interrupts 

• zero-overhead branches 

• single- and multiple-instruction loops without any overhead. 

This section describes how to use these features of the TMS320C30. 

12.2.1 Subroutines 

The TMS320C30 has a 24-bit program counter (PC) and a practically unlim¬ 
ited software stack. The CALL and CALLcond subroutine calls cause the stack 
pointer to Increment, and store the contents of the next value of the PC 
counter on the stack. At the end of the subroutine, RETScond performs a 
conditional return. 

Example 12-2 illustrates the use of a subroutine to determine the dot product 
between two vectors. Given two vectors of length N, represented by the ar¬ 
rays a[0], a[1 ],..., a[N-1] and b[0], b[1 ],..., b[N-1], the dot product Is com¬ 
puted from the expression 

d = a[0] b[0] + a[1] b[1] + ... + a[N-1] b[N-1] 

Processing proceeds In the main routine to the point where the dot product 
Is to be computed. It is assumed that the arguments of the subroutine have 
been appropriately Initialized. At this point, a CALL is made to the subroutine, 
transferring control to that section of the program memory for execution, then 
returning to the calling routine via the RETS instruction when execution has 
completed. Note for this particular example, it would suffice to save the reg¬ 
ister R2. However, a larger number of registers are saved for demonstration 
purposes. The saved registers are stored on the system stack. This stack 
should be large enough to accommodate the maximum anticipated storage 
requirements. Besides this way of saving registers, any other method could 
be used equally well. 
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Example 12-2. Subroutine Call (Dot Product) 

* 

* TITL SUBROUTINE CALL (DOT PRODUCT) 

* 

* 

* MAIN ROUTINE THAT CALLS THE SUBROUTINE * DOT' TO COMPUTE THE 

* DOT PRODUCT OF TWO VECTORS. 


LDI 

LDI 

LDI 

@blkO,ARO 

@blkl,ARl 

N,RC 

; ARO points to vector a 
; ARl points to vector b 
; RC contains the number of elements 

CALL 

DOT 



* SUBROUTINE DOT 

★ 


* EQUATION: d = a(0) * b(0) + a(l) * b(l) + ... + a(N“l) * b(N-l) 

* 

* THE DOT PRODUCT OF a AND b IS PLACED IN REGISTER RO. N MUST 

* BE GREATER THAN OR EQUAL TO 2. 

* 


* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* ARO I ADDRESS OF a(0) 

* ARl I ADDRESS OF b(0) 

* RC I LENGTH OF VECTORS (N) 

* 


* REGISTERS USED AS INPUT: ARO, ARl, RC 

* REGISTER MODIFIED: RO 

* REGISTER CONTAINING RESULT: RO 

* 

* 

* 

.global DOT 

* 


DOT 


* 
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PUSHF 

R2 

PUSH 

R2 

PUSH 

ST 

PUSH 

ARO 

PUSH 

ARl 

PUSH 

RC 

MPYF3 

*AR0,*AR1,R0 

LDF 

0.0,R2 

SUBI 

2,RC 


Use the stack to save R2’S 
bottom 32 and top 32 bits 
Save status register 
Save ARO 
Save A-Rl 
Save RC 
Initialize RO: 
a(0) * b(0) -> RO 
Initialize R2. 

Set RC = N-2 
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* DOT PRODUCT (1 <= i < N) 


RPTS 

RC 

Setup the repeat single. 

MPYF3 

*-t-+ARO(l) ,* ++ARl(l) ,R0 ; a{i) 

* b(i) -> RO 

ADDF3 

R0,R2,R2 

a{i-1)*b(i~l) + 

R2 -> R2 

ADDF3 

o 

CN 

o 

a(N-l)*b(N~l) + 

R2 -> RO 

RETURN SEQUENCE 



POP 

RC 

Restore RC 


POP 

ARl 

Restore ARl 


POP 

ARO 

Restore ARO 


POP 

ST 

Restore ST 


POPF 

R2 

Restore top 32 

bits of R2 

POP 

R2 

Restore bottom 

32 bits of R2 

RETS 


Return 



* end 

* 

. end 


12.2.2 Software Stack 

The TMS320C30 has a software stack whose location is determined by the 
contents of the stack pointer register SP. The stack pointer increments from 
low to high values, and provisions should be made to accommodate the an¬ 
ticipated storage requirements. The stack can be used not only during the 
subroutine CALL and RETS, but also inside the subroutine as a place of tem¬ 
porary storage of the registers as shown in Example 12-2. SP always points 
to the last value pushed on the stack. 


The CALL and CALLcond instructions push the value of the program counter 
on the stack, as do the interrupt routines. Then, RETScond and RETIcond pop 
the stack and place the value in the program counter. The integer value of any 
register can be pushed on and popped off the stack using the PUSH and POP 
instructions. There are two additional Instructions, PUSHF and POPF, for 
floating point numbers. These instructions can be used to pop and push 
floating point numbers to registers R0-R7. This feature is very useful if It is 
desired to save the extended precision registers (see Example 12-2). By using 
PUSH and PUSHF on the same register, the lower 32 and the upper 32 bits 
are saved. PUSH saves the lower 32; PUSHF, the upper 32. To recover this 
extended precision number, a POPF can be done followed by POP. It Is Im¬ 
portant to do the integer and floating-point PUSH and POP in the above or¬ 
der. POPF forces the last eight bits of the extended-precision registers to zero. 
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The stack pointer (SP) can be both read from, and written to. Multiple stacks 
for different program segments may be easily created. SP is not initialized by 
the hardware during reset. It is therefore important to remember to initialize 
its value so that SP points to a predetermined memory location. This avoids 
the problem of SP attempting to write Into ROM or over other useful data. 
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12.2.3 Interrupt Service Routines 

Interrupts on the TMS320C30 are prioritized and vectored. When an interrupt 
occurs, the corresponding flag is set in the Interrupt Flag Register IF. If the 
corresponding bit in the Interrupt Enable Register IE Is set, and interrupts are 
enabled by having the GIE bit in the status register set to 1, interrupt proc¬ 
essing begins. The interrupt flag register can also be written to. This enables 
the user to force an Interrupt by software, or to clear interrupts without proc¬ 
essing them. 

The Interrupt Flag Register IF can be read, and action taken based on whether 
the interrupt has occurred. This is true even when the interrupt is disabled. 
This can be useful when an interrupt-driven interface is not implemented. Ex¬ 
ample 12-3 shows the case where a subroutine is called when interrupt 1 has 
not occurred. 

Example 12-3. Use of Interrupts for Software Polling 

* TITL INTERRUPT POLLING 


TSTB 2,IF ; Test if interrupt 1 has occurred 

CALLZ SUBROUTINE ; If not, call subroutine 


12 


When Interrupt processing begins, the program counter is pushed on the 
stack, and the interrupt vector is loaded in the program counter. Interrupts are 
then disabled by setting the GIE=0, and the program continues from the ad¬ 
dress loaded in the program counter. Since all Interrupts are disabled. Inter¬ 
rupt processing may proceed without further interruption, unless the interrupt 
service routine re-enables interrupts. 

Except for very simple interrupt service routines. It is Important to assure that 
the processor context is saved during execution of this routine. The context 
must be saved before executing the routine Itself, and restored after the routine 
is finished. The procedure Is called context switching. Context switching Is 
also useful for subroutine calls, especially when extensive use is made of the 
auxiliary and the extended precision registers. Code examples of context 
switching and an interrupt service routine are provided In this section. 
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12.2.3.1 Context Switching 

Context switching is commonly required when processing a subroutine call 
or interrupt. It may be quite extensive or simple, depending on system re¬ 
quirements. On the TMS320C30, the program counter is automatically 
pushed on the stack. If there Is any Important Information in the other 
TMS320C30 registers, such as the status, auxiliary or extended-precision re¬ 
gisters, these must be saved by special commands. 

Examples 12-4 and 12-5 show saving and restoring of the TMS320C30 state. 
In both examples, the stack Is used for saving the registers, and it expands 
towards higher addresses. If it is not desirable to use the stack pointed at by 
SP, a separate stack can be created using an auxiliary register as the stack 
pointer. The registers saved are: 

• Extended-precision registers RO through R7 

• Auxiliary registers ARO through ART 

• Data page pointer DP 

• Index registers IRQ and IR1 

• Block size register BK 

• Status register ST 

• Interrupt-related registers IE and IF 

• I/O flag lOF 

• Repeat-related registers RS, RE, and RC 
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Example 12-4. Context Save For The TMS320C30 


* 

* 

* 

* 

* 

* 


TITL CONTEXT-SAVE FOR THE TMS320C30 

.global SAVE 

CONTEXT SAVE ON SUBROUTINE CALL OR INTERRUPT. 


SAVE: 


* 



* 

SAVE THE EXTENDED PRECISION 

* 




PUSH 

RO ; 


PUSHF 

RO ; 


PUSH 

R1 ; 


PUSHF 

R1 


PUSH 

R2 ; 


PUSHF 

R2 ; 


PUSH 

R3 ; 


PUSHF 

R3 ; 


PUSH 

R4 ; 


PUSHF 

R4 ; 


PUSH 

R5 ; 


PUSHF 

R5 ; 


PUSH 

R6 ; 


PUSHF 

R6 ; 


PUSH 

R7 ; 


PUSHF 

R7 ; 

* 



* 

SAVE THE AUXILIARY REGISTER 

* 




PUSH 

ARO ; 


PUSH 

ARl ; 


PUSH 

AR2 ; 


PUSH 

AR3 ; 


PUSH 

AR4 ; 


PUSH 

AR5 ; 


PUSH 

AR6 ; 


PUSH 

AR7 ; 

* 



* 

SAVE THE REST REGISTERS FRO 

* 




PUSH 

DP 


PUSH 

IRO ; 


PUSH 

IRl ; 


PUSH 

BK 


PUSH 

ST ; 


PUSH 

IE ; 


PUSH 

IF ; 


PUSH 

lOF ; 


PUSH 

RS 


PUSH 

RE ; 


PUSH 

RC ; 

* 



* 

SAVE IS COMPLETE 

★ 




REGISTERS 

Save the lower 32 bits of RO 
and the upper 32 bits 
Save the lower 32 bits of R1 
and the upper 32 bits 
Save the lower 32 bits of R2 
and the upper 32 bits 
Save the lower 32 bits of R3 
and the upper 32 bits 
Save the lower 32 bits of R4 
and the upper 32 bits 
Save the lower 32 bits of R5 
and the upper 32 bits 
Save the lower 32 bits of R6 
and the upper 32 bits 
Save the lower 32 bits of R7 
and the upper 32 bits 


Save ARO 
Save ARl 
Save AR2 
Save AR3 
Save AR4 
Save AR5 
Save AR6 
Save AR7 

THE REGISTER FILE 

Save data page pointer 
Save index register IRQ 
Save index register IRl 
Save block-size register 
Save status register 
Save interrupt enable register 
Save interrupt flag register 
Save I/O flag register 
Save repeat start address 
Save repeat end address 
Save repeat counter 
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Example 12-5. Context-Restore For The TMS320C30 

* 

* TITL CONTEXT-RESTORE FOR THE TMS320C30 

* 

* 


■k 

.GLOBAL 

RESTR 



* 

k 

CONTEXT RESTORE AT THE END 

OF A SUBROUTINE CALL OR INTERRUPT. 

RESTR: 

k 




k 

k 

RESTORE THE 

REST REGISTERS 

FROM THE 

REGISTER FILE 


POP 

RC 

Restore 

repeat counter 


POP 

RE 

Restore 

repeat end address 


POP 

RS 

Restore 

repeat start address 


POP 

lOF 

Restore 

I/O flag register 


POP 

IF 

Restore 

interrupt flag register 


POP 

IE 

Restore 

interrupt enable register 


POP 

ST 

Restore 

status register 


POP 

BK 

Restore 

block-size register 


POP 

IRl 

Restore 

index register IRl 


POP 

IRO 

Restore 

index register IRO 

k 

POP 

DP 

Restore 

data page pointer 

k 

k 

RESTORE THE 

AUXILIARY REGISTERS 



POP 

AR7 

Restore 

AR7 


POP 

AR6 

Restore 

AR6 


POP 

AR5 

Restore 

AR5 


POP 

AR4 

Restore 

AR4 


POP 

AR3 

Restore 

AR3 


POP 

AR2 

Restore 

AR2 


POP 

ARl 

Restore 

ARl 

k 

POP 

ARO 

Restore 

ARO 

k 

k 

RESTORE THE 

EXTENDED PRECISION REGISTERS 


POPF 

R7 

Restore 

the upper 32 bits and 


POP 

R7 

the 

lower 32 bits of R7 


POPF 

R6 

Restore 

the upper 32 bits and 


POP 

R6 

the 

lower 32 bits of R6 


POPF 

R5 

Restore 

the upper 32 bits and 


POP 

R5 

the 

lower 32 bits of R5 


POPF 

R4 

Restore 

the upper 32 bits and 


POP 

R4 

the 

lower 32 bits of R4 


POPF 

R3 

Restore 

the upper 32 bits and 


POP 

R3 

the 

lower 32 bits of R3 


POPF 

R2 

Restore 

the upper 32 bits and 


POP 

R2 

the lower 32 bits of R2 


POPF 

R1 

Restore 

the upper 32 bits and 


POP 

R1 

the 

lower 32 bits of R1 


POPF 

RO 

Restore 

the upper 32 bits and 


POP 

RO 

The 

lower 32 bits of RO 


* 


* RESTORE IS COMPLETE 

* 
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12.2.3.2 Interrupt Priority 

Interrupts on the TMS320C30 are automatically prioritized. This allows in¬ 
terrupts that occur simultaneously to be serviced in a predefined order. Infre¬ 
quent but lengthy, interrupt service routines may need to be interrupted by 
more frequently occurring interrupts. In Example 12-6, the interrupt service 
routine for INT2 temporarily modifies the interrupt enable register IE, to permit 
interrupt processing when an interrupt to INTO (but no other interrupt) occurs. 
When the routine has finished processing, the register IE is restored to its ori¬ 
ginal state. Notice that the RETI instruction not only pops the next program 
counter address from the stack, but also sets the GIE bit of the status register. 
This enables all Interrupts which have their interrupt-enable bit set. 

Example 12-6. Interrupt Service Routine 

* TITL INTERRUPT SERVICE ROUTINE 

* .global ISR2 

ENABLE .set 2000h 

MASK .set 1 

* 

* INTERRUPT PROCESSING FOR EXTERNAL INTERRUPT INT2- 

* 

ISR2: 

PUSH 
PUSH 
PUSH 
PUSH 
PUSHF 
PUSH 
PUSHF 
LDI 
OR 

* 

* MAIN PROCESSING SECTION FOR ISR2 


ST 

DP 

IE 

RO 

RO 

R1 

R1 

MASK,IE 
ENABLE,ST 


Save status register 
Save data page pointer 
Save interrupt enable register 
Save lower 32 bits and 
upper 32 bits of RO 
Save lower 32 bits and 
upper 32 bits of Rl 
Unmask only INTO 
Enable all interrupts 


XOR 

ENABLE,ST 

POPF 

Rl 

POP 

Rl 

POPF 

RO 

POP 

RO 

POP 

IE 

POP 

DP 

POP 

ST 

RETI 



Disable all interrupts 
Restore upper 32 bits and 
lower 32 bits of Rl 
Restore upper 32 bits and 
lower 32 bits of RO 
Restore interrupt enable register 
Restore data page register 
Restore status register 

Return and enable interrupts 
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12.2.4 Delayed branches 

The TMS320C30 offers the capability of single-cycle branching through the 
use of the delayed branches. The delay branches operate like regular branches 
but do not flush the pipeline. Instead, the three instructions following a de¬ 
layed branch are also executed. As discussed in the section on program-flow 
control, the only limitation is that the three instructions following a delayed 
branch cannot be a: 

• Branch (standard or delayed) 

• Call to a subroutine 

• Return from a subroutine 

• Return from an interrupt 

• Repeat instructions 

• A TRAP instruction 

• An IDLE instruction 

Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. Sometimes, a branch 
is necessary in the flow of a program, but less than three instructions can be 
placed after a delayed branch. For faster execution, it is still advantageous to 
use a delayed branch. This is shown in Example 12-7, with NOP's taking the 
place of the unused Instructions. The trade-off is more Instruction words for 
less execution time. 

Example 12-7. Delayed Branch Execution 

* TITL DELAYED BRANCH EXECUTION 


* 


SKIP 


LDF *+AR1(5),R2 

BGED SKIP 

LDFN R2,R1 

SUBF 3.0,R1 

NOP 

MPYF 1.5,R1 


LDF R1,R3 


Load contents of memory to R2 
If loaded number >=0, branch (delayed) 
If loaded number <0, load it to R1 
Subtract 3 from R1 

Dummy operation to complete delayed 
branch 

Continue here if loaded number <0 


12 


Continue here if 


loaded number >=0 
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12.2.5 Repeat Modes 

The TMS320C30 supports looping without any overhead. For that purpose, 
there are two instructions: RPTB repeats a block of code, and RPTS repeats a 
single instruction. There are three control registers RS (repeat start address), 
RE (repeat end address), and RC (repeat counter). These contain the param¬ 
eters that specify loop execution (refer to Section 7.1 for a complete de¬ 
scription of RPTB and RPTS). RS and RE are automatically set from the code, 
while RC has to be set by the user, as shown In the examples below. 

12.2.5.1 Block Repeat 

Example 12-8 shows an application of the block repeat construct. In this ex¬ 
ample, an array of 64 elements is "flipped over" by exchanging the elements 
that are equidistant from the end of the array. In other words, if the original 
array is: 

a(1), a(2).a(31), a(32),..., a(64); 

the final array after the rearrangement will be: 
a(64), a(63),..., a(32), a(31).a(1). 

Note that since the exchange operation Is done on two elements at the same 
time, there is a need of 32 operations. The repeat counter RC Is initialized to 
31. In general. If RC contains the number N, the loop will be executed N + 1 
times. The loop is defined by the RPTB instruction and the EXCH label. 

Example 12-8. Loop Using Block Repeat 

* TITL LOOP USING BLOCK REPEAT 

* 

* THIS CODE SEGMENT EXCHANGES THE VALUES OF ARRAY ELEMENTS THAT ARE 

* SYMMETRIC AROUND THE MIDDLE OF THE ARRAY. 



LDI 

@ADDR,ARO 


ARO points to the beginning of 


LDI 

AR0,AR1 




ADD I 

63,AR1 

} 

ARl points to the end of the 

* 



} 

64-element array 

•k 

LDI 

31,RC 

} 

Initialize repeat counter 


RPTB 

EXCH 

} 

Repeat RC+1 times between here 

k 




EXCH 


LDI 

*ARO,RO 

/ 

Load one memory element in RO, 

1 1 

LDI 

*AR1,R1 

• 

and the other in R1 

EXCH 

STI 

R1,*AR0++(1) 

/ 

Then, exchange their locations 

1 1 

STI 

R0,*AR1—(1) 



Section 7.1.2 specifies restrictions in the block-repeat construct. The basic 
rule is, since the program counter is modified at the end of the loop according 
to the contents of the registers RS, RE, and RC, no operation should attempt 
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to modify the repeat counter or the program counter at the end of the loop in 
a different way. 

In principle, it is possible to nest repeat blocks. However, there is only one 
set of control registers RS, RE, and RC. It is therefore necessary to save these 
registers before entering an inside loop. It may be more economical to im¬ 
plement a nested loop by the more traditional method of a register serving as 
a counter and then using a delayed branch rather than applying the above 
approach. 

Example 12-9 shows another example of using the block repeat to find a 
maximum of 147 numbers. 

Example 12-9. Use of Block Repeat to Find a Maximum 

* 

* 

* TITL USE OF BLOCK REPEAT TO FIND A MAXIMUM 

* 

* THIS ROUTINE FINDS THE MAXIMUM OF N=147 NUMBERS 

* 


LDI 

LDI 

LDF 

* 

RPTB 

CMPF 

LOOP LDFLT 


146,RC ; Initialize repeat counter to 147-1 

@ADDR,ARO ; ARO points to the beginning of the array 
*AR0++(1),R0 ; Initialize MAX to the first value 

LOOP 

*AR0++(1),R0 ; Compare number to the maximum 
*-ARO(l),RO ; If greater, this is a new maximum 


12.2.5.2 Single-Instruction Repeat 

The single instruction repeat operates using the control registers RS, RE, and 
RC, in the same way as the block repeat. The advantage over the block repeat 
is that the instruction Is fetched only once, and then the buses are available 
for moving operands. One difference to note Is that the single-instruction re¬ 
peat construct Is not Interruptible, while block repeat Is interruptible. 

Example 12-10 shows an application of the repeat-single construct. In this 
example, the sum of the products of two arrays Is computed. The arrays are 
not necessarily different. If the arrays are a(i) and b(i), each of length N=512, 
register RO will contain, after computation, the quantity: 

a(1)b(1)+a(2)b(2) + ...+a(N)b(N). 

The value of the repeat counter (RC) is specified to be 511 in the instruction. 
If RC contains the number N, the loop will be executed N+l times. 
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Example 12-10. 


* TITL LOOP USING SINGLE REPEAT 

* 

* N 

* THIS CODE SEGMENT COMPUTES X a(i)b(i) 

* i=l 

* 


LDI 

LDI 

* 

LDF 

* 

MPYF3 

* 

RPTS 

•k 

MPYF3 
I I ADDF3 

* 

ADDF 


12.2.6 Computed GOTO's 

Occasionally, it is convenient to select during runtime, and not during assem¬ 
bly, what subroutine needs to be executed. The TMS320C30 offers the ca¬ 
pability of a computed GOTO that can satisfy such a need. The computed 
GOTO is implemented using the CALLcond instruction in the register ad¬ 
dressing mode. This instruction uses the contents of the register as the ad¬ 
dress of the call. Example 12-11 shows the case of a task controller. 


@ADDR1,AR0 ; ARO points to array a(i) 

@ADDR2,AR1 ; ARl points to array b(i) 

0.0,RO ; Initialize RO 

*AR0++(1),*AR1++(1),R1 

; Compute first product 
511 ; Repeat 512 times 

*AR0++(1),*AR1++{1),R1,R0 ; Compute next product 

R1,R0,R0 ; and accumulate the previous one 

R1,R0 ; One final addition 


12 
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Example 12-11. Computed GOTO 

* TITL COMPUTED GOTO 

* 

* TASK CONTROLLER 

* 

* THIS MAIN ROUTINE CONTROLS THE ORDER OF TASK EXECUTION(6 TASKS 

* IN THE PRESENT EXAMPLE). TASKO THROUGH TASKS ARE THE NAMES OF 

* SUBROUTINES TO BE CALLED. THEY ARE EXECUTED IN ORDER, TASKO, 

* TASKl, . . .TASKS. WHEN AN INTERRUPT OCCURS, THE INTERRUPT 

* SERVICE ROUTINE IS EXECUTED, AND THE PROCESSOR CONTINUES 

* WITH THE INSTRUCTION FOLLOWING THE IDLE INSTRUCTION. THIS 

* ROUTINE SELECTS THE TASK APPROPRIATE FOR THE CURRENT CYCLE, 

* CALLS THE TASK AS A SUBROUTINE, AND BRANCHES BACK TO THE IDLE 

* TO WAIT FOR THE NEXT SAMPLE INTERRUPT WHEN THE SCHEDULED TASK 

* HAS COMPLETED EXECUTION. RO HOLDS THE OFFSET FROM THE BASE 

* ADDRESS OF THE TASK TO BE EXECUTED. 

* 


WAIT 

* 


* 

TSKSEQ 


ADDR 


LDI 

5,R0 

LDI 

@ADDR,ARl 

IDLE 
ADD I 

*AR1,R0,R1 

SUBI 

1,R0 

LDILT 

5,R0 

CALLU 

R1 

BR 

WAIT 

.word 

TASKS 

.word 

TASK4 

.word 

TASKS 

.word 

TASK2 

.word 

TASKl 

.word 

TASKO 

.WORD 

TSKSEQ 


Initialize RO 

ARl holds the base address of the table 
Wait for the next interrupt 
Add the base address to the table 
Entry number 
Decrement RO 

If R0<0, reinitialize it to 5 
Execute appropriate task 


Address of TASKS 
Address of TASK4 
Address of TASKS 
Address of TASK2 
Address of TASKl 
Address of TASKO 
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12.3 Logical and Arithmetic Operations 

The TMS320C30 instruction set supports both integer and floating point 
arithmetic and logical operations. The basic functions of such instructions can 
be combined to form more complex operations. This section examines exam¬ 
ples of such operations, such as: 

• Bit manipulation 

• Block moves 

• Bit-reversed addressing 

• Integer and floating-point division 

• Square root 

• Extended precision arithmetic 

• Floating point format conversion between IEEE and TMS320C30 

formats. 

12.3.1 Bit Manipulation 

The instructions of the TMS320C30 for the usual logical operations, such as 
AND, OR, NOT, ANDN, and XOR, can be used together with the shift in¬ 
structions for bit manipulation. In addition to these instructions, there is a 
special instruction, TSTB, for testing bits. TSTB does the same operation as 
AND, but the result of the logical AND Is not written anywhere and is only 
used to set the condition flags. Examples 12-12 and 12-13 demonstrate the 
use of the several instructions for bit manipulation and testing. 

Example 12-12. Use of TSTB for Software-Controlled Interrupt 

* TITL USE OF TSTB FOR SOFTWARE-CONTROLLED INTERRUPT 

* 

* IN THIS EXAMPLE, ALL INTERRUPTS HAVE BEEN DISABLED BY 

* RESETTING THE GIE BIT OF THE STATUS REGISTER. WHEN AN 

* INTERRUPT ARRIVES, IT IS STORED IN THE IF REGISTER. THE 

* PRESENT EXAMPLE ACTIVATES THE INTERRUPT SERVICE ROUTINE INTR 

* WHEN IT DETECTS THAT INT2- HAS OCCURRED. 


12 


TSTB 4,IF ; Check if bit 2 of IF is set, 

CALLNZ INTR ; and, if so, call subroutine INTR 
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Example 12-13. Copy a Bit From One Location to Another 

* TITL COPY A BIT FROM ONE LOCATION TO ANOTHER 


* BIT I OF R1 NEEDS TO BE COPIED TO BIT J OF R2. 

* ARO POINTS TO A LOCATION HOLDING 1, AND IT IS ASSUMED THAT THE 

* NEXT MEMORY LOCATION HOLDS THE VALUE J. 

* 

* I 

* i 


R1 


★ 

* 


* 


* 


* 


* 


* 


* 


J 

I 



R2 


*ARO 


*(AR0+1) 


LDI 

1,R0 



LSH 

*ARO,RO 

Shift 1 

to align it with bit I 

TSTB 

R1,R0 

Test the 

I-th bit of R1 

BZD 

CONT 

If bit = 

0, branch delayed 

LDI 

1,R0 



LSH 

*+ARO(l),R0 

Align 1 

with J-th location 

ANDN 

R0,R2 

If bit = 

0, reset J-th bit of R2 

OR 

R0,R2 

If bit = 

1, set J-th bit of R2 


CONT 
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12.3.2 Block Moves 

Since the TMS320C30 directly addresses a large amount of memory, blocks 
of data or program code can be stored off-chip in slow memories and then 
loaded on-chip for faster execution. Data can also be moved from on-chip to 
off-chip for storage or for multiprocessor data trasfers. 

Such data transfers can be accomplished very efficiently In parallel with CPU 
operations, using the DMA. The DMA operation Is explained In detail in an 
earlier section of this manual. An alternative to DMA is to perform data 
transfers under program control using load and store instructions in a repeat 
mode. Example 12-14 shows the case where a block of 512 floating-point 
numbers are transferred from external memory to block 1 of the on-chip RAM. 

Example 12-14. Block Move Under Program Control 


* TITL BLOCK MOVE UNDER PROGRAM CONTROL 

* 


extern 

.word 

OlOOOH 



blockl 

.word 

0809C00H 




LDI 

@extern,ARO 

/ 

Source address 


LDI 

@blockl,ARl 

} 

Destination address 


LDF 

*ARO++,RO 

} 

Load the first number 


RPTS 

510 

/ 

Repeat following instruction 


LDF 

*AR0++,R0 

} 

Load the next number, and... 

1 1 

STF 

RO,*AR1++ 

! 

store the previous one 


STF 

R0,*AR1 

/ 

Store the last number 


12.3.3 Bit-Reversed Addressing 


12 


For an efficient implementation of Fast Fourier Transforms (FFT), the 
TMS320C30 offers the capability of bit-reversed addressing. If the data to 
be transformed Is In the correct order, the final result of the FFT is scrambled 
(In bit-reversed order). To recover the frequency-domain data in the correct 
order, certain memory locations have to be swapped. The bit-reversed ad¬ 
dressing mode offers the alternative of not doing this swapping. The next time 
data needs to be accessed, the access is done in a bit-reversed manner rather 
than sequentially. 


In bit-reversed addressing, IRO holds a value equal to one-half the size of the 
FFT, if real and Imaginary data are stored in separate arrays. During accessing, 
the auxiliary register is indexed by IRO, but with reverse carry propagation. 
Example 12-15 illustrates a 512-point complex FFT being moved from the 
place of computation (pointed at by ARO) to a location pointed at by AR1. 
In this example, real and Imaginary parts XR(i) and Xl(l) of the data are not 
stored in separate arrays, but they are interleaved XR(0),XI(0),XR(1 ),XI(1),..., 
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XR(N-1 ),XI(N-1). Because of this arrangement the length of the array is 2N 
instead of N, and IRO is set to 512 instead of 256. 

Example 12-15. Bit-Reversed Addressing 

* 

* TITL BIT-REVERSED ADDRESSING 

* 

* THIS EXAMPLE MOVES THE RESULT OF THE 512-POINT FFT 

* COMPUTATION, POINTED AT BY ARO, TO A LOCATION POINTED AT 

* BY ARl. REAL AND IMAGINARY POINTS ARE ALTERNATING. 


LDI 

LDI 

LDI 

LDF 

RPTB 


LDF 

I j STF 

* 

LOOP LDF 

I J STF 


512,IRO 
2,IR1 

511,RC ; Repeat 511+1 times 

*+ARO(l),Rl ; Load first imaginary point 
LOOP 

*ARO++(IRO)B,RO ; Load real value (and point 
R1,*+AR1(1) ; to next location) and store 

; the imaginary value 

*+ARO(l),Rl ; Load next imaginary point and store 
RO,*AR1++(IRl) ; previous real value 


12.3.4 integer and Floating-point Division 

Although division is not implemented as a single instruction in the 
TMS320C30, the instruction set provides the necessary capabilities for an ef¬ 
ficient division routine. Integer and floating-point division will be examined 
separately because different algorithms are used. 


12.3A. 1 Integer Division 


Division is implemented on the TMS320C30 by repeated subtractions using 
SUBC, a special conditional subtract Instruction. Consider the case of a 32-bit 
positive dividend with i significant bits (and 32-i sign bits), and a 32-blt po¬ 
sitive divisor with j significant bits (and 32-j sign bits). The repetition of the 
SUBC command i-j + 1 times produces a 32 bit result where the lower i-j+1 
bits are the quotient, and the upper 31 -l+j bits, the remainder of the division. 


SUBC Implements binary division in the same manner as In long division. The 
divisor (assumed to be smaller than the dividend) is shifted left i-j times to be 
aligned with the dividend. Then, using SUBC, the shifted divisor is subtracted 
from the dividend. For each subtract that does not produce a negative answer, 
the dividend is replaced by the difference. It Is then shifted to the left, and a 
one is put in the LSB. If the difference Is negative, the dividend Is simply 
shifted left by one. This operation is repeated I-j+1 times. 


12 


As an example, consider the division of 33 by 5 using both long division and 
the SUBC method. In this case, i=6, j=3, and the SUBC operation is repeated 
6-3+1 =4 times. 
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LONG DIVISION: 


I 00000000000000000000000000000110 

00000000000000000000000000000101 I 00000000000000000000000000100001 

-101 

1101 

-101 

11 


Quotient 


Remainder 


SUBC METHOD: 

00000000000000000000000000100001 
00000000000000000000000000101000 

Negative difference 


00000000000000000000000001000010 
00000000000000000000000000101000 

00000000000000000000000000011010 

I 

00000000000000000000000000110101 
00000000000000000000000000101000 

00000000000000000000000000001101 

I 

00000000000000000000000000011011 
00000000000000000000000000101000 

Negative difference 


00000000000000000000000000110110 I 

--rri 

Remainder Quot. 


Dividend 
Divisor (aligned) 

(1st SUBC command) 

New Dividend + Quotient 
Divisor 

Difference (>0) (2nd SUBC command) 

New Dividend + Quotient 
Divisor 

Difference (>0) (3rd SUBC command) 

New Dividend + Quotient 
Divisor 

(4th SUBC command) 

Final Result 


When using the SUBC command, both the dividend and the divisor must be 
positive. Example 12-16 shows a realization of the integer division^ where the 
sign of the quotient is properly handled. The last instruction before returning 
modifies the condition flag, in case subsequent operations depend on the sign 
of the result. 


12 
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Example 12-16. Integer Division 




* 

* TITL INTEGER DIVISION 

* 




* 

SUBROUTINE 

DIVI 




* 

* 

INPUTS: 


SIGNED INTEGER 

DIVIDEND IN RO, 

* 

•k 



SIGNED INTEGER 

DIVISOR IN 

Rl. 

■k 

■k 

OUTPUT: 


RO/Rl into RO. 




k 

k 

REGISTERS 

USED: 

R0-R3, IRQ, IRl 



k 

OPERATION: 


1. NORMALIZE DIVISOR WITH 

DIVIDEND 

k 



2. REPEAT SUBC 




* 

★ 



3. QUOTIENT IS 

IN 

LSBs OF 

RESULT 

* 

ie 

CYCLES: 


31-62 (DEPENDS 

ON 

AMOUNT OF NORMALIZATION) 


.globl 

DIVI 





SIGN 

. set 

R2 





TEMPF 

. set 

R3 





TEMP 

. set 

IRQ 





COUNT 

. set 

IRl 





* DIVI “ SIGNED DIVISION 





DIVI: 

* 

* DETERMINE SIGN OF RESULT. GET ABSOLUTE VALUE OF OPERANDS. 


* 

XOR 

ABSI 

ABSI 

R0,R1,SIGN 

RO 

Rl 

; Get the 

sign 


CMP I 

R0,R1 

; Divisor 

> dividend ? 

* 

BHID 

ZERO 

; If so. 

return 0 


* NORMALIZE OPERANDS. USE DIFFERENCE IN EXPONENTS AS SHIFT COUNT 


* FOR DIVISOR, 

* 

AND AS REPEAT 

COUNT FOR 'SUBC'. 

FLOAT 

RO,TEMP 

Normalize dividend 

PUSHF 

TEMPF 

PUSH as float 

POP 

COUNT 

POP as int 

LSH 

-24,COUNT 

Get dividend exponent 

FLOAT 

Rl,TEMPF 

Normalize divisor 

12 

PUSHF 

TEMPF 

PUSH as float 

POP 

TEMP 

POP as int 

LSH 

-24,TEMP 

Get divisor exponent 

SUBI 

TEMP,COUNT 

Get difference in exponents 

LSH 

COUNT,Rl 

Align divisor with dividend 
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* DO COUNT+1 SUBTRACT & SHIFTS. 

* 

RPTS COUNT 
SUBC R1,R0 

* 

* MASK OFF THE LOWER COUNT+1 BITS OF RO 


SUBRI 

31,COUNT 

; Shift 

count is 

(32 

- (COUNT+1)) 

LSH 

COUNT,RO 

; Shift 

left 



NEGI 

COUNT 





LSH 

COUNT,RO 

; Shift 

right to 

get 

result 


* CHECK SIGN AND NEGATE RESULT IF NECESSARY. 

* 

NEGI 
ASH 
LDINZ 
CMP I 
RETS 

•k 

* RETURN ZERO. 

* 

ZERO: 

LDI 0,R0 

RETS 
. end 


R0,R1 ; Negate result 

-31,SIGN ; Check sign 

R1,R0 ; If set, use negative result 

0,R0 ; Set status from result 


If the dividend is less than the divisor and fractional division is desired, a di¬ 
vision can be performed after determining the desired accuracy of the quotient 
in bits. If the desired accuracy is k bits, start by shifting the dividend left by k 
positions. Then apply the algorithm described above, where i should now be 
replaced by i+k. It is assumed that i+k is less than 32. 


12.3.4.2 Computation of Floating-point inverse and Division 

This section presents a method of implementing floating-point division on the 
TMS320C30. Since the algorithm outlined here computes the inverse of a 
number v, to divide y/v, multiply y by the inverse of v. 

The computation of 1/v is based on the following iterative algorihm. At the 
i-th iteration, the estimate x[i] of 1/v is computed from v, and the previous 
estimate x[i-1] according to the formula: 

x[i] = x[i-1] * (2.0 - V* x[l-1]) 


12 


To start the operation, an initial estimate x[0] is needed. If v=a*2®, a good 
initial estimate Is: 

x[0] = 1.0 * Z-®-! 

Example 12-17 shows the implementation of this algorithm on the 
TMS320C30, where the iteration has been applied 5 times. The choice of the 
number of iterations was based on the desire to have maximum accuracy. The 
accuracy offered by the single-precision floating-point format is 
2-23 = 1 192E-7. |f more accuracy is desired, more iterations can be used. If 
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less accuracy is acceptable, the execution speed of this implementation can 
be increased by reducing the number of iterations. 

This algorithm properly treats the boundary conditions, when the Input num¬ 
ber is either zero or it has a very large value. When the Input is zero, the ex¬ 
ponent e=-128. Then the calculation of x[0] yields an exponent equal to 
-(-128)-1 =127 and the algorithm will overflow and saturate. On the other 
hand, in the case of a very large number, e=127, the exponent of x[0] will be 
-127-1 =-128. This will cause the algorithm to yield zero, which Is a reason¬ 
able handling of that boundary condition. 



Software Applications - Logical and Arithmetic Operations 


Example 12-17. Inverse of a Floating-Point Number 

* 

* TITL INVERSE OF A FLOATING-POINT NUMBER 

* 

* 

* SUBROUTINE INVF 

★ 

* 

* THE FLOATING-POINT NUMBER V IS STORED IN RO. AFTER THE 

* COMPUTATION IS COMPLETED, 1/v IS ALSO STORED IN RO. 

* 

* TYPICAL CALLING SEQUENCE: 

* LDF V, RO 

* CALL INVF 

* 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* — — — — — — — — — — — — — — — — — — 

* RO I V = NUMBER TO FIND THE RECIPROCAL OF (UPON THE CALL) 

* RO I 1/v (UPON THE RETURN) 

* 


* REGISTER USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, R3 

* REGISTER CONTAINING RESULT: RO 


* 

* 

* 


CYCLES: 

35 

WORDS: 32 


* 

* 


.global 

INVF 




INVF: 

LDF 

R0,R3 

/ 

V is saved for later. 


* 


ABSF 

RO 

/ 

The algorithm uses v = 

= |v|. 

* 

•k 

EXTRACT THE 

EXPONENT 

OF V. 





PUSHF 

RO 






POP 

Rl 






ASH 

-24,R 

/ 

The 8 LSBs of Rl contain the exponent 

* 

■k 




} 

of V . 


* 

•k 

x[0] 

FORMATION GIVEN 

THE EXPONENT OF v. 




NEGI 

Rl 






SUBI 

1,R1 

/ 

Now we have -e-1, the 

exponent of x[0 



ASH 

24,Rl 






PUSH 

Rl 






POPF 

Rl 

} 

Now Rl = x[0] = 1.0 * 

2**(-e-l). 


* 


12 
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* 

* 


* 


* 


* 


* 


* 

* 

* 


★ 


★ 


* 


* 

* 


* 


* 

* 


* 


NOW 

THE ITERATIONS BEGIN. 




MPYF 

R1,R0,R2 

/ 

R2 = 

V * x[0] 


SUBRF 

2.0,R2 

/ 

R2 = 

= 2.0 - V * x[0] 


MPYF 

R2,R1 

} 

R1 = 

x[l] = x[0] * (2.0 - V * x[0]) 


MPYF 

R1,R0,R2 

} 

R2 = 

V * x[l] 


SUBRF 

2.0,R2 

} 

R2 = 

2.0 - V * x[l] 


MPYF 

R2,R1 

} 

R1 = 

x[2] = x[l] * (2.0 - V * x[l]) 


MPYF 

R1,R0,R2 

} 

R2 = 

V * x[2] 


SUBRF 

2.0,R2 

7 

R2 = 

2.0 - V * x[2] 


MPYF 

R2,R1 

7 

R1 = 

x[3] = x[2] * (2.0 - V * x[2]) 


MPYF 

R1,R0,R2 

7 

R2 = 

V * x[3] 


SUBRF 

2.0,R2 

7 

R2 = 

2.0 - V * x[3] 


MPYF 

R2,R1 

7 

R1 = 

x[4] = x[3] * (2.0 - V * x[3]) 


RND 

R1 

7 

This 

minimizes error in the LSBs. 

FOR 

THE LAST 

ITERATION 

WE USE THE 

FORMULATION: 

x[5] 

1 = {x[4] 

* (1.0 - 

(v * X 

[4]))) + x[4] 


MPYF 

R1,R0,R2 

7 

R2 = 

V * x[4] = 1.0..01.. => 1 


SUBRF 

1.0,R2 

7 

R2 = 

1.0 - V * x[4] = 0.0..01... => 0 


MPYF 

R1,R2 

7 

R2 = 

x[4] * (1.0 - V * x[4]) 


ADDF 

R2,R1 

7 

R2 = 

x[5] = (x[4]*(1.0-(v*x[4])))+x[4] 


RND 

R1,R0 

7 

Round since this is follow by a MPYF. 

NOW 

THE CASE 

OF V < 0 

IS HANDLED. 



NEGF 

R0,R2 





LDF 

R3,R3 

7 

This 

sets condition flags. 


LDFN 

R2,R0 


If V 

< 0, then RO = -RO 


RETS 






END 

. end 


12.3.5 Square Root 


The implementation of the square root on the TMS320C30 is done by an it¬ 
erative algorithm very similar to the one used for the computation of the in¬ 
verse. This algorithm computes the inverse of the square root of a number v, 
1/SQRT(v). To derive SQRT(v), multiply this result by v. Since in many ap¬ 
plications, division by the square root of a number is desirable, the output of 
the algorithm saves the effort to compute the inverse of the square root. 


12 


At the i-th iteration, the estimate x[i] of 1/SQRT(v) is computed from v and 
the previous estimate x[i-1] according to the formula: 


x[i] = x[i-1] * (1.5 - (v/2) * x[i-1] * x[i-1]) 

To start the operation, an initial estimate x[0] is needed. If v=a*2®, a good 
initial estimate is: 


x[0] = 1.0 * 2-®/2 
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Example 12-18 shows the implementation of this algorithm on the 
TMS320C30, where the iteration has been applied 5 times. The choice of the 
number of iterations was based on the desire to have maximum accuracy. If 
more accuracy is desired, more iterations can be used. If less accuracy is ac¬ 
ceptable, the execution speed of this implementation may be Increased by re¬ 
ducing the number of iterations. 
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Example 12-18. Square Root of a Floating-Point Number 

* 

* TITL SQUARE ROOT OF A FLOATING-POINT NUMBER 

* 

* 

* SUBROUTINE SQRT 

* 

* THE FLOATING POINT NUMBER v IS STORED IN RO. AFTER THE 

* COMPUTATION IS COMPLETED, SQRT(v) IS ALSO STORED IN RO. NOTE 

* THAT THE ALGORITHM ACTUALLY COMPUTES l/SQRT(v). 

* 
ic 

* TYPICAL CALLING SEQUENCE: 

* 

LDF V, RO 
CALL SQRT 
ARGUMENT ASSIGNMENTS: 

ARGUMENT | FUNCTION 

—-——— 

RO I V = NUMBER TO FIND THE SQUARE ROOT OF 

I (UPON THE CALL) 

RO I SQRT(v) (UPON THE RETURN) 

* REGISTER USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, R3 

* REGISTER CONTAINING RESULT: RO 

* 

* CYCLES: 39 WORDS: 33 

* 

.global SQRT 

★ 

* EXTRACT THE EXPONENT OF V. 

* 

SQRT: LDF R0,R3 

RETSLE 
PUSHF RO 

POP R1 

ASH -25,R1 

* 

* 

* X[0] FORMATION GIVEN 

* 

NEGI 
ASH 
PUSH 
POPF 

* 

* GENERATE V/2. 

* 

MPYF 0.5,R0 

* 

* NOW THE ITERATIONS BEGIN. 

* 

MPYF R1,R1,R2 
MPYF R0,R2 
SUBRF 1.5,R2 
MPYF R2,R1 

* 


R2 = x[0] * x[0] 

R2 = (v/2) * x[0] * x[0] 

R2 = 1.5 - (v/2) * x[0] * x[0] 
R1 = x[l] = x[0] * 

(1.5 - (v/2)*x[0]*x[0]) 


R1 

24,R1 
R1 

R1 ; Now R1 = x[0] = 1.0 * 2**(-e/2). 


; Save v 

; Return if number non-positive 


; The 8 LSBs of R1 contain 1/2 the exponent 
; of V. 

THE EXPONENT OF V. 


* 

* 

* 

* 

* 

* 

* 

* 

* 
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end 


MPYF 

R1,R1,R2 

/ 

R2 

= 

x[ll 

* x[l] 

MPYF 

R0,R2 

/ 

R2 

= 

(v/2) 

* x[l] * x[l] 

SUBRF 

1.5,R2 

7 

R2 

= 

1.5 - 

(v/2) * x[l] * x[ 

MPYF 

R2,R1 

7 

R1 

= 

x[2] 

= x[ll * 



7 


(1 

5 - (v/2)*x[l]*x[l]) 

MPYF 

R1,R1,R2 

; 

R2 

= 

x[2] 

* x[2] 

MPYF 

R0,R2 

7 

R2 

= 

(v/2) 

* x[2] * x[2] 

SUBRF 

1.5,R2 

7 

R2 

= 

1.5 - 

(v/2) * x[2] * x[ 

MPYF 

R2,R1 

; 

R1 


x[3] 

= x[2] 



7 


* 

(1.5 

- (v/2)*x[2]*x[2] ) 

MPYF 

R1,R1,R2 

7 

R2 

= 

x[3] 

* x[3] 

MPYF 

R0,R2 

; 

R2 

= 

(v/2) 

* x[3] * x[3] 

SUBRF 

1.5,R2 

7 

R2 


1.5 - 

(v/2) * x[3] * x[ 

MPYF 

R2,R1 

7 

R1 

= 

x[4] 

= x[31 



; 


* 

(1.5 

- (v/2)*x[3]*x[3] ) 

MPYF 

R1,R1,R2 

7 

R2 

= 

x[4] 

* x[41 

MPYF 

R0,R2 

7 

R2 


(v/2) 

* x[4] * x[4] 

SUBRF 

1.5,R2 

7 

R2 

= 

1.5 - 

(v/2) * x[4] * x[ 

MPYF 

R2,R1 

7 

R1 

= 

x[5] 

= x[4] 



7 


* 

(1.5 

- (v/2)*x[4]*x[4]) 

RND 

R1,R0 

7 

Round 


MPYF 

R3,R0 

7 

Sqrt(v) from sqrt(v**(-1)) 

RETS 







. end 








12.3.6 Extended-Precision Arithmetic 

The TMS320C30 offers 32 bits of precision for integer arithmetic, and 24 bits 
of precision in the mantissa for floating point arithmetic. For higher precision 
in floating-point operations, the eight extended-precision registers RO to R7 
contain eight more bits of accuracy. Since no corhparable extension is avail¬ 
able for fixed-point arithmetic, this section discusses hdw4ixed-point double 
precision can be achieved using the capabilities of the processor. The tech¬ 
nique consists of performing the arithmetic by parts, similar to the way In 
which longhand arithmetic Is done. 

The instruction set has operations AD DC (Add with Carry) and SUBB (Sub¬ 
tract with Borrow) which use the status carry bit for extended-precision 
arithmetic. The carry bit is affected by the arithmetic operations of the ALU, 
and the rotate and shift instructions. It can also be manipulated directly by 
setting the status register to certain values. For proper operation, the overflow 
mode bit should be reset (OVM = 0) so the accumulator results will not be 
loaded with the saturation values. Example 12-19 and Example 12-20 show 
64-bit addition and 64-bit subtraction. The first operand is stored in the reg¬ 
isters RO (low word) and R1 (high word). The second operand is stored In 
R2 and R3 respectively. The result is stored In RO and R1. 
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Example 12-19. 64-Bit Addition 


* TITL 64-BIT ADDITION 

* 

* TWO 64-BIT NUMBERS ARE ADDED TO EACH OTHER PRODUCING A 64-BIT 

* RESULT. THE NUMBERS X (R1,R0) AND Y (R3,R2) ARE ADDED, 

* RESULTING IN W (R1,R0). 

* 

* R1 RO 

* + R3 R2 

* _ 

* R1 RO 

* 

ADDI R2,R0 

ADDC R3,R1 


Example 12-20. 64-Bit Subtraction 

* TITL 64-BIT SUBTRACTION 

* 

* TWO 64-BIT NUMBERS ARE SUBTRACTED FROM EACH OTHER PRODUCING 

* A 64-BIT RESULT. THE NUMBERS X (R1,R0) AND Y (R3,R2) ARE 

* SUBTRACTED, RESULTING IN W (R1,R0). 

* 

* R1 RO 

* - R3 R2 

* _ 

* R1 RO 

* 

SUBI R2,R0 

SUBB R3,R1 


When two 32-bit numbers are multiplied, a 64-bit product results. The pro¬ 
cedure for multiplication is to split the 32-bit magnitude values of the multi¬ 
plicand X and the multiplier Y into two parts (X1 ,X0) and (X3,X2) respectively 
with 16 bits each. The operation is done on unsigned numbers, and the pro¬ 
duct is adjusted for the sign bit. Example 12-21 shows the Implementation 
of a 32 bit X 32 bit multiplication. 
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12 


Example 12-21. 32 by 32 Bit Multiplication 

* 

* TITL 32 X 32 BIT MULTIPLICATION 

* 

* 

* SUBROUTINE EXTMPY 

* 

* FUNCTION: TWO 32-BIT NUMBERS ARE MULTIPLIED, PRODUCING A 64-BIT 

* RESULT. THE TWO NUMBERS (X and Y) ARE EACH SEPARATED INTO TWO 

* PARTS (XI XO) AND (Y1 YO), WHERE XO, XI, YO, AND Y1 ARE 16 BITS. 

* THE TOP BIT IN XI AND Y1 IS THE SIGN BIT. THE PRODUCT IS 

* IN TWO WORDS (WO AND Wl). THE MULTIPLICATION IS PERFORMED ON 

* POSITIVE NUMBERS, AND THE SIGN IS DETERMINED AT THE END. 

* 


* 


* 


* 

* 

* 


* 


* 

* 

* 


* 

* 

* 


XI XO 
X Y1 YO 


BITS OF PRODUCTS 

(NOT COUNTING SIGN) PRODUCT 


XO*YO 

X0*Y1 

Xl*YO 

X1*Y1 


16+16 PI 

16+16 P2 

16+16 P3 

16+16 P4 


Wl WO 


ARGUMENT ASSIGNMENTS 
ARGUMENT j FUNCTION 

RO I MULTIPLIER AND LOW WORD OF THE PRODUCT 

R1 I MULTIPLICAND AND UPPER WORD OF THE PRODUCT 


* REGISTERS USED AS INPUT: RO, R1 

* REGISTERS MODIFIED: RO, Rl, R2, R3, R4, ARO, ARl, 

* REGISTER CONTAINING.RESULT: R0,R1 

* 


* 


* 

* 

CYCLES: 

28 (WORST 

CASE) 

WORDS: 25 

•k 

.GLOBAL 

EXTMPY 



EXTMPY 

XOR3 

R0,R1,AR0 

/ 

Store sign 


ABSI 

RO 

r 

Absolute values of X 


ABSI 

Rl 


and Y 


* 


* SEPARATE 

* 

MULTIPLIER AND 

MULTIPLICAND 

INTO TWO PARTS 



LDI 

-16,ARl 







LSH3 

AR1,R0,R2 

; R2 = XI 

= Upper 

16 

bits 

of 

X 

AND 

OFFFFH,RO 

; RO = XO 

= Lower 

16 

bits 

of 

X 

LSH3 

AR1,R1,R3 

; R3 = Y1 

= Upper 

16 

bits 

of 

Y 

AND 

0FFFFH,R1 

; Rl = YO 

= Lower 

16 

bits 

of 

Y 
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* 

* CARRY OUT THE MULTIPLICATION 

* 


DONE 


MPYI3 

R0,R1,R4 

XO*YO = PI 

MPYI 

R3,R0 

X0*Y1 = P2 

MPYI 

R2,R1 

X1*Y0 = P3 

ADD I 

R0,R1 

P2+P3 

MPYI 

R2,R3 

X1*Y1 = P4 

LDI 

R1,R2 


LSH 

16, R2 

Lower 16 bits of P2+P3 

CMP I 

0,AR0 

Check the sign of the product 

BGED 

DONE 

If >0, multiplication complete (delayed) 

AND 

0FFFFH,R1 

Upper 16 bits of P2+P3 

ADDI3 

R4,R2,R0 

WO = RO = Lower word of the product 

ADDC3 

R1,R3,R1 

W1 = R1 = Upper word of the product 

TE THE 

PRODUCT IF THE NUMBERS ARE OF OPPOSITE SIGN 

NOT 

RO 


ADDI 

1,R0 


NOT 

R1 


ADDC 

0,R1 


RETS 



. end 




12.3.7 Floating-point Format Conversion: IEEE to/from TIV/IS320C30 

In fixed-point arithmetic, the binary point that separates the integer from the 
fractional part of the number is fixed at a certain location. For example, if it 
is chosen that a 32-bit number has the binary point after the most significant 
bit (which is also the sign bit), only fractional numbers (numbers with abso¬ 
lute values less that 1), can be represented. In this case. It is said that we have 
a Q31 number, where 31 is the number of fractional bits. All operations as¬ 
sume that the binary point is fixed at this location. 

The fixed-point system, although simple to implement in hardware, imposes 
limitations In the dynamic range of the represented number, which causes 
scaling problems In many applications. The difficulty is avoided by using 
floating-point numbers. A floating-point number consists of a mantissa m 
multiplied by base b raised to an exponent e: 

m * 


In current hardware Implementations, the mantissa is typically a normalized 
number with absolute value between 1 and 2, and the base is b=2. Although 
the mantissa is represented as a fixed-point number, the actual value of the 
overall number floats the binary point because of the multiplication by b^ The 
exponent e is an integer whose value determines the position of the binary 
point In the number. IEEE has established a standard format for the repre¬ 
sentation of floating-point numbers. 


12 


In order to achieve higher efficiency in the hardware implementation, the 
TMS320C30 uses a floating-point format that differs from the IEEE standard. 
This section describes briefly the two formats and presents software routines 
to convert between them. 
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TMS320C30 floating-point format: 


8 


23 



In a 32-blt word representing a floating-point number, the first 8 bits corre¬ 
spond to the exponent expressed in two's-complement format There is one 
bit for sign, and 23 bits for the mantissa. The mantissa is expressed in two's- 
complement form with the binary point after the most significant non-sign bit 
Since this bit is the complement of the sign bit s, it is suppressed. In other 
words, the mantissa actually has 24 bits. One special case occurs when 
e=-128. In this case, the number is interpreted as zero independent of the 
values of s and f (which are by default set to zero). To summarize, the values 
of the represented numbers in the TMS320C30 floating-point format are: 


2® * (01 .f) If s=0 

2®*(10.f) ifs=1 

0 ife=-128 


IEEE floating-point format: 


1 8 


23 



12 


The IEEE floating-point format uses sign-magnitude notation for the mantissa, 
and offset by 127 for the exponent. In a 32-blt word representing a float¬ 
ing-point number, the first bit is the sign bit. The next 8 bits correspond to 
the exponent, expressed in an offset-by-127 format (the actual exponent Is 
e-127). The following 23 bits represent the absolute value of the mantissa 
with the most significant 1 implied. The binary point Is after this most signif¬ 
icant 1. In other words, the mantissa actually has 24 bits. There are several 
special cases, summarized below. 


12-36 




Software Applications - Logical and Arithmetic Operations 


The values of the represented numbers in the IEEE floating-point format are: 


(- 1)8 * 2®-'*27 * (01 . f ) 

Special cases: 

(- 1)8 * 0.0 

(-I)S * 2-126 . (O.f) 

(-l)s * infinity 

NaN 


if 0<e<255 


if e=0 and f=0 (zero) 
if e=0 and f<>0 (denormalized) 
if e=255 and f=0 (infinity) 
if e=255 and f<>0 (Not a Number) 


Based on these definitions of the formats, two versions of the conversion 
routines were developed. One version handles the complete definition of the 
formats. The other ignores some of the special cases (typically the ones that 
are very rarely used), but it has the benefit that it executes faster than the 
complete conversion. For this discussion, they are referred to as the complete 
version and the the fast version. 

12.3.7.1 IEEE to TMS320C30 Floating-Point Format Conversion 

The fast version of the IEEE-to-TMS320C30 conversion routine was originally 
developed by Keith Henry of Apollo Computer, Inc. The other routines were 
based on this initial input. Example 12-22 shows the fast conversion from 
IEEE to TMS320C30 floating-point format. It properly handles the general 
case when 0<e<255, and zeros (i.e., e=0 and f=0). The other special cases 
(denormalized, infinity, and NaN) are not treated and, if present, will give er¬ 
roneous results. 


12 
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Example 12-22. TMS320C30 To IEEE Conversion (Fast Version) 


* TITL IEEE TO TMS320C30 CONVERSION (FAST VERSION) 

* 

■k 

* SUBROUTINE FMIEEE 

* 

* FUNCTION; CONVERSION BETWEEN THE IEEE FORMAT AND THE 

* 320C30 FLOATING POINT NUMBERS. THE NUMBER TO 

* BE CONVERTED IS IN THE LOWER 32 BITS OF RO. 

* THE RESULT IS STORED IN THE UPPER 32 BITS OF RO. 

* UPON ENTERING THE ROUTINE, ARl POINTS TO THE 

* FOLLOWING TABLE: 

* 

* (0) OxFFSOOOOO <— ARl 

* (1) OxFFOOOOOO 

* (2) OxVFOOOOOO 

* (3) 0x80000000 

* (4) 0x81000000 

* 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* - + - 

* RO I NUMBER TO BE CONVERTED 

* ARl I POINTER TO TABLE WITH CONSTANTS 

* 

* REGISTERS USED AS INPUT: RO, ARl 

* REGISTERS MODIFIED: RO, R1 

* REGISTER CONTAINING RESULT: RO 

* 

* NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO 

* INITIALIZE IT IN THE CALLING PROGRAM. 

* 


* 


* CYCLES; 12 (WORST CASE) WORDS: 12 

* 

.global FMIEEE 

★ 


FMIEEE AND3 
BND 
ADDI 
LDIZ 
SUBI 
PUSH 
POPF 
RETS 


R0,*AR1,R1 

NEG 

R0,R1 

*+ARl(l),R1 
*+ARl(2),R1 
R1 
RO 


; Replace fraction with 0 
; Test sign 

; Shift sign and exponent inserting 0 
; If all zero, generate C30 zero 
; Unbias exponent 

; Load this as a fit. pt. number 


NEG 


12 


PUSH R1 
POPF RO 
NEGF RO,RO 

RETS 


Load this as a fit. pt. number 
Negate if original sign negative 


Example 12-23 is the complete conversion between IEEE and TMS320C30 
formats. In addition to the general case and the zeros, it handles the special 
cases as follows: 


If NaN (e=255, f<>0), the number Is returned Intact 

If Infinity (e=255, f=0); the output is saturated to the most positive or 

negative number respectively. 
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• If denormalized (e=0, f<>0), two cases are considered. If the MSB of 

f Is 1, the number is converted to TMS320C30 format. Otherwise, an 
underflow occurs and the number is set to zero. 


12 
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Example 12-23. IEEE to TMS320C30 conversion (complete version) 


* TITL IEEE TO TMS320C30 CONVERSION (COMPLETE VERSION) 


SUBROUTINE 


FMIEEEl 


FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE 320C30 

FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 
IS IN THE LOWER 32 BITS OF RO. THE RESULT IS STORED 
IN THE UPPER 32 BITS OF RO. 


UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE: 


* 

(0) 

OxFFSOOOOO 

<-- ARl 

* 

(1) 

OxFFOOOOOO 


* 

(2) 

OxVFOOOOOO 


* 

(3) 

0x80000000 


* 

(4) 

0x81000000 


* 

(5) 

0x7F800000 


* 

(6) 

0x00400000 


* 

(7) 

0x007FFFFF 


* 

■k 

(8) 

0x7F7FFFFF 


k 

ARGUMENT ASSIGNMENTS: 


k 

k 

ARGUMENT 

1 FUNCTION 


k 

RO 

1 NUMBER TO BE CONVERTED 

k 

k 

ARl 

1 POINTER TO 

TABLE WITH CONSTANTS 

k 

REGISTERS USED AS INPUT: 

RO, ARl 

k 

REGISTERS MODIFIED: RO, 

R1 

k 

k 

REGISTER CONTAINING RESULT: RO 

k 

NOTE: SINCE 

THE STACK POINTER SP IS USED, MAKE SURE TO INITIALIZE 

k 

k 

IT IN 

THE CALLING 

PROGRAM. 

k 

k 

k 

CYCLES: 23 

(WORST CASE) 

WORDS: 34 

k 

.global FMIEEEl 


FMIEEEl LDI 

R0,R1 



AND 

*+ARl(5),R1 



BZ 

UNNORM 

; If e=0, number is either 0 or 

* 

XOR 

*+ARl(5),R1 

; unnormalized 


BNZ 

NORMAL 

; If e<255, use regular routine 

* 

HANDLE NaN 

AND INFINITY 



TSTB 

RETSNZ 

*+ARl(7),R0 

; Return if NaN 


LDI 

R0,R0 



LDFGT 

LDFN 

*+ARl(8),R0 

; If positive, infinity= 

; most positive number 

; If negative, infinity= 


*+ARl(5),R0 


RETS 


; most negative number RETS 
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* HANDLE ZEROS AND UNNORMALIZED NUMBERS 


UNNORM 

TSTB 

*-l-ARl(6) ,R0 

; Is the msb of f equal to 1? 



LDFZ 

*+ARl(3),R0 

; If not, force the number to 

zero 


RETSZ 


; and return 



XOR 

*-l-ARl(6) ,R0 

; If (msb of f)=l, make it 0 



BND 

NEGl 




LSH 

1,R0 

; Eliminate sign bit and line 

up mantissa 


SUBI 

*-HARl(2) ,R0 

; Make e=-127 



PUSH 

RO 




POPE 

RETS 

RO 

; Put number in floating point 

format 

NEGl 

POPE 

RO 




NEGE 

RETS 

RO,RO 

; If negative, negate RO 



* HANDLE THE REGULAR CASES 

* 


NORMAL 

AND 3 

R0,*AR1,R1 

; Replace fraction with 0 



BND 

NEG 

; Test sign 



ADD I 

R0,R1 

; Shift sign and exponent 

inserting 0 


SUBI 

*+ARl(2),R1 

; Unbias exponent 



PUSH 

R1 




POPE 

RETS 

RO 

; Load this as a fit. pt. 

number 

NEG 

POPE 

RO 

; Load this as a fit. pt. 

number 


NEGE 

RETS 

RO,RO 

; Negate if original sign 

negative 


12.3.7.2 TMS320C30 to IEEE Floating-Point Format conversion 

The vast majority of the numbers represented by the TMS320C30 format are 
covered by the general IEEE format and the representation of zeros. The only 
special case to consider is when e=-127 In the TMS320C30 format. This 
corresponds to an denormalized number In IEEE format and it Is ignored in the 
fast version, while it Is treated properly in the complete version. Example 
12-24 shows the fast and Example 12-25, the complete version of the 
TMS320C30-to-IEEE conversion. 
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12 


Example 12-24. TMS320C30 to IEEE Conversion (Fast Version) 


* TITL TMS320C30 TO IEEE CONVERSION (FAST VERSION) 

* 


* SUBROUTINE TOIEEE 

* 


* 


* 

* 

* 

* 

* 

* 


* 

* 

* 


* 

* 

* 

* 

* 

* 

* 

* 


FUNCTION: CONVERSION BETWEEN THE 320C30 FORMAT AND THE IEEE 

FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 
IS IN THE UPPER 32 BITS OF RO. THE RESULT WILL BE IN 
THE LOWER 32 BITS OF RO. 


UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE: 

(0) OxFFSOOOOO <-- ARl 

(1) OxFFOOOOOO 

(2) OxTFOOOOOO 

(3) 0x80000000 

(4) 0x81000000 


ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 


RO I NUMBER TO BE CONVERTED 

ARl I POINTER TO TABLE WITH CONSTANTS 


REGISTERS USED AS INPUT: RO, ARl 
REGISTERS MODIFIED: RO 
REGISTER CONTAINING RESULT: RO 


NOTE: SINCE THE STACK POINTER 'SP’ IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 


* 


* CYCLES: 14 (WORST CASE) 

* 


* 

TOIEEE 


NEG 


.global 

TOIEEE 

LDF 

R0,R0 

LDFZ 

*+ARl(4),R0 

BND 

NEG 

ABSF 

RO 

LSH 

1,R0 

PUSHF 

RO 

POP 

RO 

ADDI 

*+ARl(2),R0 

LSH 

-1,R0 

RETS 



POP 

RO 

ADDI 

*+ARl(2),R0 

LSH 

-1,R0 

ADDI 

*+ARl(3),R0 

RETS 



WORDS: 15 


Determine the sign of the number 
If zero, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value of the number 
Eliminate the sign bit in RO 

Place number in lower 32 bits of RO 
Add exponent bias (127) 

Add the positive sign 


Place number in lower 32 bits of RO 
Add exponent bias (127) 

Make space for the sign 
Add the negative sign 
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Example 12-25. TMS320C30 to IEEE Conversion (Complete Version) 


* TITL TMS320C30 TO IEEE CONVERSION (COMPLETE VERSION) 

* 


* 

* 

* 


* 

* 

* 

* 

* 

* 


* 

★ 

* 

* 

* 

* 

* 

★ 


* 

* 

* 


* 


★ 

* 


* 


SUBROUTINE TOIEEEl 


FUNCTION: CONVERSION BETWEEN THE 320C30 FORMAT AND THE IEEE 

FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 
IS IN THE UPPER 32 BITS OF RO. THE RESULT WILL BE 
IN THE LOWER 32 BITS OF RO. 


UPON 

ENTERING THE ROUTINE, 

(0) 

OxFFSOOOOO <— ARl 

(1) 

OxFFOOOOOO 

(2) 

0X7F000000 

(3) 

0x80000000 

(4) 

0x81000000 

(5) 

0X7F800000 

(6) 

0x00400000 

(7) 

0x007FFFFF 

(8) 

0x7F7FFFFF 

ARGUMENT 

ASSIGNMENTS: 


ARGUMENT | FUNCTION 


ARl POINTS 


TO THE FOLLOWING TABLE: 


RO I NUMBER TO BE CONVERTED 

ARl I POINTER TO TABLE WITH CONSTANTS 


REGISTERS USED AS INPUT: RO, ARl 
REGISTERS MODIFIED: RO 
REGISTER CONTAINING RESULT: RO 


NOTE: SINCE THE STACK POINTER 'SP* IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 


* 

* 

* 


CYCLES: 31 (WORST CASE) WORDS: 25 


* 

TOIEEEl LDF 
LDFZ 
BND 
ABSF 
LSH 
PUSHF 
POP 
ADD I 
LSH 


TOIEEEl 

RO,RO 

*+ARl(4),R0 

NEG 

RO 

1,R0 

RO 

RO 

*+ARl(2),R0 
-1,R0 


global 


Determine the sign of the number 
If zero, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value of the number 
Eliminate the sign bit in RO 

Place number in lower 32 bits of RO 
Add exponent bias (127) 

Add the positive sign 
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CONT TSTB *+ARl(5),R0 

RETSNZ ; If E>0, return 

TSTB *+ARl(7),R0 

RETSZ ; If E=0 & F=0, return 

PUSH RO 

POPE RO 

LSH “1,R0 ; Move F right by one bit 

PUSHF RO 

POP RO 

ADDI *+ARl(6),R0 ; Add to F a msb of 1 

RETS 


NEG POP RO ; Place number in lower 32 bits of RO 

BRD CONT 

ADDI *+ARI(2),R0 ; Add exponent bias (127) 

LSH -1,R0 ; Make space for the sign 

ADDI *+ARl(3),R0 ; Add the negative sign 
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12.4 Application-Oriented Operations 

The TMS320C30 has been designed to provide efficient implementations of 
digital signal processing algorithms. The architecture and the instruction set 
of the device include features that facilitate the solution of numerically inten¬ 
sive problems. This section presents examples of applications using these 
features, such as companding, filtering, Fast Fourier Transforms (FFT), and 
matrix arithmetic. 

12.4.1 Companding 

In the area of telecommunications, one of the primary concerns is the conser¬ 
vation of the channel bandwidth, while at the same time preserving high 
speech quality. This is achieved by quantizing the speech samples logarith¬ 
mically. It has been demonstrated that an 8-bit logarithmic quantizer produces 
speech quality equivalent to a 13-blt uniform quantizer. The logarithmic 
quantization Is achieved by companding (COMpress/exPANDing). Two in¬ 
ternational standards have been established for companding: the p-law (used 
in the United States and Japan), and the A-law (used in Europe). Detailed 
descriptions of p-law and A-law companding are presented in an application 
report on companding routines included In the book "Digital Signal Process¬ 
ing Applications with the TMS320 Family". 

During transmission, logarithmically compressed data In sign-magnitude form 
are transmitted along the communications channel. If any processing is nec¬ 
essary, these data should be expanded to a 14-bit (for p-law) or 13-bit (for 
A-law) linear format. This operation is done upon receiving the data at the 
digital signal processor. After processing, and in order to continue trans¬ 
mission, the result is compressed back to 8-bit format and transmitted through 
the channel. 

Examples 12-26 and 12-27 show p-law compression and expansion (i.e., li¬ 
near to M-law and p-law to linear conversion), while examples 12-28 and 
12-29 show A-law compression and expansion. For expansion, using a 
look-up table offers an alternative approach. It trades memory space for speed 
of execution. Since the compressed data Is 8-blts long, a table with 256 en¬ 
tries can be constructed containing the expanded data. If the compressed data 
Is stored In the register ARO, the following two instructions will put the ex¬ 
panded data In register RO: 

ADDI @TABL,ARO ; @TABL = BASE ADDRESS OF TABLE 

LDI *ARO,RO ; PUT EXPANDED NUMBER IN RO 

The same look-up table approach could be used for compression, but the re¬ 
quired table length would then be 16,384 words for p-law or 8,192 words for 
A-law. If this memory size is not acceptable, the subroutines presented in 
Examples 12-26 or 12-28. should be used. 
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Example 12-26. U-Law compression 


* 

* TITL U-LAW COMPRESSION 

* 

* 

* SUBROUTINE MUCMPR 

* 

* 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* - + - 

* RO I NUMBER TO BE CONVERTED 

* 


* REGISTERS USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, SP 

* REGISTER CONTAINING RESULT: RO 

* 


* NOTE: SINCE THE STACK POINTER *SP* IS USED IN THE COMPRESSION 

* ROUTINE 'MUCMPR*, MAKE SURE TO INITIALIZE IT IN THE 

* THE CALLING PROGRAM. 

* 


* 


* 

* 

* 


* 


CYCLES: 20 WORDS: 17 


.global MUCMPR 


MUCMPR LDI 
ABSI 
CMP I 
LDIGT 
ADDI 


R0,R1 
R0,R0 
1FDEH,R0 
1FDEH,R0 
33,RO 


FLOAT RO 

MPYF 0.03125,RO 

LSH 1,RO 

PUSHF RO 

POP RO 

LSH -20,RO 


12 


LDI 0,R2 

LDI R1,R1 

LDILT 80H,R2 
ADDI R2,R0 

NOT RO 

RETS 


Save sign of number 

If R0>0xlFDE, 
saturate the result 
Add bias 

Normalize: (seg+5)OWXYZx...x 

Adjust segment number by 2**(-5) 
(seg)WXYZx...x 

Treat number as integer 
Right-justify 


If number is negative, 
set sign bit 
RO = compressed number 
Reverse all bits for transmission 
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Example 12-27. U-Law Expansion 

★ 

* TITL *U-LAW EXPANSION* 


* SUBROUTINE MUXPND 

* 

★ 

* ARGUMENT ASSIGNMENTS; 

* 

* ARGUMENT | FUNCTION 

* RO I NUMBER TO' BE CONVERTED 

* 

* REGISTERS USED AS INPUT; RO 

* REGISTERS MODIFIED; RO, Rl, R2, SP 

* REGISTER CONTAINING RESULT; RO 

* 

* 

* CYCLES; 20 (WORST CASE) WORDS; 14 

* 

* 

.global MUXPND 
MUXPND NOT RO,RO 

LDI R0,R0 

AND 0FH,R1 

LSH 1,R1 

ADDI 33,Rl 

LDI R0,R2 

LSH “4,R0 

AND 7,R0 

LSH3 R0,R1,R0 

SUBI 33,RO 

TSTB 80H,R2 

RETSZ 
NEGI RO 

RETS 


12 


; Complement bits 

; Isolate quantization bin 

; Add bias to introduce Ixxxxl 
; Store for sign bit 

; Isolate segment code 
; Shift and put result in RO 
; Subtract bias 
; Test sign bit 

; Negate if a negative number 
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12 


Example 12-28. A-Law Compression 


* TITL A-LAW COMPRESSION 


* SUBROUTINE ACMPR 

* 


* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* - + - 

* RO I NUMBER TO BE CONVERTED 

ie 

* REGISTERS USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, SP 

* REGISTER CONTAINING RESULT: RO 

* 

* NOTE: SINCE THE STACK POINTER 'SP' IS USED IN THE COMPRESSION 


ROUTINE 'ACMPR', MAKE 
CALLING PROGRAM. 


* CYCLES:22 

WORDS: 1' 

* 


.globl 

ACMPR 

* 

ACMPR 

LDI 

R0,R1 


ABSI 

RO,RO 


CMP I 

1FH,R0 


BLED 

END 


CMP I 

OFFFH,RO 


LDIGT 

OFFFH,RO 


LSH 

-1,R0 


FLOAT 

RO 


MPYF 

0.125,R0 


LSH 

1,R0 


PUSHF 

RO 


POP 

RO 


LSH 

-20,RO 

END 

LDI 

0,R2 


LDI 

R1,R1 


LDILT 

80H,R2 


ADD I 

R2,R0 


XOR 

RETS 

0D5H,R0 

* 


SURE TO INITIALIZE IT IN THE 


Save sign of number 

If R0<0x20, 

Do linear coding 
If RO>OxFFF, 

saturate the result 
Eliminate rightmost bit 

Normalize: (seg+3)OWXYZx...x 
Adjust segment number by 2**(-3) 
(seg)WXYZx...x 

Treat number as integer 
Right-justify 


If number is negative, 
set sign bit 
RO = compressed number 
Invert even bits for transmission 
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Example 12-29. A-Law Expansion 

* 

* TITL A-LAW EXPANSION 


* SUBROUTINE AXPND 


* 

* 

* 

* 

* 

* 


ARGUMENT ASSIGNMENTS: 

ARGUMENT | FUNCTION 

- +- 

RO I NUMBER TO BE CONVERTED 


* REGISTERS USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, SP 

* REGISTER CONTAINING RESULT: RO 


* 

* 

* 

* 

* 

AXPND 


SKIPl 


CYCLES: 25 (WORST CASE) WORDS: 16 


.global 

AXPND 


XOR 

D5H,R0 

} Invert even bits 

LDI 

R0,R1 


AND 

0FH,R1 

; Isolate quantization bin 

LSH 

1,R1 


LDI 

R0,R2 

; Store for bit sign 

LSH 

-4,R0 


AND 

7,R0 

; Isolate segment code 

BZ 

SKIPl 


SUBI 

1,R0 


ADD I 

32,Rl 

; Create Ixxxxl 

ADDI 

1,R1 

; OR Oxxxxl 

LSH3 

R0,R1,R0 

; Shift and put result in RO 

TSTB 

80H,R2 

; Test sign bit 

RETSZ 



NEGI 

RO 

; Negate if a negative number 

RETS 




12.4.2 FIR, HR, and Adaptive Filters 

Digital filters are a common requirement for digital signal processing systems. 
There are two types of digital filters. Finite Impulse Response (FIR) and Infi¬ 
nite Impulse Response (MR). Each of these types can have either fixed or 
adaptable coefficients. In this section, the fixed-coefficient filters are pre¬ 
sented first, and then the adaptive filters are discussed. 
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12,4.2.1 FIR Filters 

If the FIR filter has an impulse response /7[0], /?[!],..., /7[N-1], and x[n'\ re¬ 
presents the input of the filter at time n, the output y[n] at time n is given by 
the equation: 

/[a7] =/7[0] x[n] + h[^] X [n-^] + ...+ h[N-^] x[a7-(N-1)] 

Two features of the TMS320C30 that facilitate the implementation of the FIR 
filters, are parallel multiply/add operations and the circular addressing. The 
first one permits the performance of a multiplication and an addition in a single 
machine cycle, while the second one makes a finite buffer of length N suffi¬ 
cient for the data x. 

Figure 12-1 shows the arrangement of the memory locations in order to im¬ 
plement the circular addressing, while Example 12-30 presents the 
TMS320C30 assembly code for an FIR filter. 



impulse 


initial 

final 

low 

response 


input samples 

input samples 

address 

h(N~1) 


x[n-(N-1)l 


xin) 


h{N-2) 

I oldest input 

x[n-(N-2)] 


x[n-(N-1)l 


• 


• 

• 


• 


• 

• 


• 


• 

• 


h(1) 


x(n-1) 


x(n-2) 

high 

h(0) 

I newest input 

x(n) 


x(n-1) 

address 





circular 

queue 


Figure 12-1. Data Memory Organization For a FIR Filter 
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In order to set up the circular addressing, the block-size register BK should 
be initialized to block length N. Also, the locations for signal x should start 
from a memory location whose address is a multiple of the smallest power of 
2 that is greater than or equal to N. For instance, if N=24, the first address for 
X should be a multiple of 32 (the lower 5 bits of the beginning address should 
be zero). To understand this requirement, look at the section describing cir¬ 
cular addressing. 
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In Example 12-30, the pointer to the input sequence x, is incremented and 
assumed to be moving from an older input to a newer input. At the end of the 
subroutine, AR1 will be pointing to the position for the next input sample. 

Example 12-30. FIR Filter 


* TITL FIR FILTER 

* 


* SUBROUTINE FIR 

* 

* EQUATION: y(n) = h(0) * x(n) + h(l) * x(n-l) + 

* ... + h(N-l) * x(n-(N-l)) 

* 

* TYPICAL CALLING SEQUENCE; 

* 


* 

LOAD 

ARO 

* 

LOAD 

ARl 

* 

LOAD 

RC 

* 

LOAD 

BK 

* 

CALL 

FIR 


★ 


* 


* ARGUMENT ASSIGNMENTS; 

* ARGUMENT I FUNCTION 

* — — — 

* ARO I ADDRESS OF h(N-l) 

* ARl I ADDRESS OF x(N-l) 

* RC I LENGTH OF FILTER - 2 (N“2) 

* BK I LENGTH OF FILTER (N) 


* REGISTERS USED AS INPUT: ARO, ARl, RC, BK 

* REGISTERS MODIFIED; RO, R2, ARO, ARl, RC 

* REGISTER CONTAINING RESULT: RO 


* CYCLES: 11 + (N-1) 


WORDS; 


.global FIR 

* ; Initialize RO: 

FIR MPYF3 *AR0++(1),*AR1++(1)%,R0 

* ; h(N-l) * x(n-(N-l)) -> RO 

LDF 0.0,R2 ; Initialize R2. 

* 

* FILTER ( 1 <= i < N) 

* 

RPTS RC ; Setup the repeat cycleE. 

MPYF3 *AR0++(1),*AR1++(1)%,R0 ; h(N-l-i)*x(n-(N-l-i))->R0 

I I ADDF3 R0,R2,R2 ; Multiply and add operation 

* 

ADDF R0,R2,R0 ; Add last product 


RETURN SEQUENCE 


RETS 


Return 


* end 

* 

. end 
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12A.2.2 UR Filters 

The transfer function of the HR filters has both poles and zeros. Its output 
depends on both the Input and the past output. As a rule, they need less 
computation than an FIR with similar frequency response, but they have the 
drawback of being sensitive to coefficient quantization. Most often, the HR 
filters are implemented as a cascade of second-order sections, called biquads. 
Examples 12-31 and 12-32 show the implementation for one biquad and for 
any number of biquads respectively. 

The equation for a single biquad Is given by: 

y{n] = a^ y[n-^] + a2 y[n-2] + bO x[n] + b^ x[n-^ ]-f b2 x[n~2] 

This equation can be Implemented more conveniently by the following two 
equations which have less storage requirements: 

d[n-\ = a2 d[n-2'\+ a1 c/[r7-1]+ x[/?] 
yin] = b2 d[n-2]^ / 7 I d[n-^ + bO d[n] 

Figure 12-2 shows the memory organization for this approach, and Example 
12-31 is an implementation of a single biquad on the TMS320C30. 


low 

address 


high 

address 


filter 


initial delay 

final delay 

coefficients 


node values 

node values 

a2 

newest delay 

d(n) 


d(n-1) 

b2 


d(n-1) 


d(n-2) 

a1 

oldest delay 

d(n-2) 


din) 

b1 




bO 





circular queue 


Figure 12-2. Data Memory Organization For a Single Biquad 
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As In the case of FIR filters, the address for the start of the values d must be 
a multiple of 4, i.e., the last two bits of the beginning address must be zero. 
The block-size register BK must be initialized to 3. 
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Example 12-31. UR Filter (One Biquad) 


* TITL HR filter 

* 

* 

* SUBROUTINE I I R 1 

* 

* IIRl == HR FILTER (ONE BIQUAD) 

* 


* EQUATIONS: d(n) = a2 * d(n-2) + al * d(n-l) + x(n) 

* y(n) = b2 * d(n-2) + bl * d(n-l) + bO * d(n) 

* 

* OR y(n) = al*y(n“l) + a2*y(n-2) + bO*x(n) 

* + bl*x(n“l) + b2*x(n“2) 

* 

* TYPICAL CALLING SEQUENCE: 

* 


* 

load 

R2 

* 

load 

ARO 

* 

load 

ARl 

* 

load 

BK 

* 

CALL 

IIRl 


★ 


* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* — — — — — — ~ — — 

* R2 I INPUT SAMPLE X(N) 

* ARO I ADDRESS OF FILTER COEFFICIENTS (A2) 

* ARl I ADDRESS OF DELAY MODE VALUES (D(N-2)) 

* BK I BK = 3 

* 

* REGISTERS USED AS INPUT: R2, ARO, ARl, BK 

* REGISTERS MODIFIED: RO, Rl, R2, ARO, ARl 

* REGISTER CONTAINING RESULT: RO 

* 

* CYCLES: 11 WORDS: 8 

* 


* FILTER 

★ 

.global IIRl 

* 


IIRl 

* 


★ 


* 


MPYF3 *AR0,*AR1,R0 

; a2 * d(n-2) -> RO 
MPYF3 *++ARO(l),*AR1—(1)%,R1 

; b2 * d(n-2) -> Rl 

MPYF3 *++ARO(1),*AR1,R0 ; al * d(n-l) -> RO 

ADDF3 R0,R2,R2 ; a2*d(n-2)+x(n) -> R2 

MPYF3 *++ARO(l),*AR1—(1)%,R0 ; bl * d{n-l) -> RO 

ADDF3 R0,R2,R2 ; al*d(n-1)+a2*d(n-2)+x(n) -> R2 
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* 

MPYF3 *++ARO(l),R2.R2 ; bO * d(n) -> R2 


11 

STF 

R2,*AR1++(1)% 

•k 



* 

ADDF 

R0,R2 


ADDF 

R1,R2,R0 


* 


* RETURN SEQUENCE 

* 

RETS 

★ 

* end 

* 

. end 


Store d(n) and point to d(n-l). 

bl*d(n-l)+bO*d(n) -> R2 
b2*d(n-2)+bl*d(n-l)+b0*d(n) -> RO 

Return 


In the more general case, the MR filter will contain N>1 biquads. The 
equations for its implementation are given by the following pseudo-C lan¬ 
guage code: 

y[0,n] = x[n\ 
for (/=0; /<N; /++){ 

(/[4/j] = a2[/] d[i, /j-2] + a1 [/] r/[ /,/7-1] + /[ /-I,/?] 

y[4n] = 62[/] or[/;/7-2]+ b^ [/] (/[4/7-1]+ /»0[/] dii.nl 

} 

y[n] = k[N-1,/7] 

The corresponding memory organization is shown in Figure 12-3, while Ex¬ 
ample 12-32 is the TMS320C30 assembly-language code. 


12 


12-54 



Software Applications - Application-Oriented Operations 


low 

address 


filter 

coefficients 


initial delay 
node values 


final delay 
node values 


a2(0) 

newest delay 

d(0, n) 


d(0, n-1) 

b2(0) 


d(0, n-1) 


d(0, n-2) 

a1(0) 

oldest delay 

d(0. n-2) 


d(0, n) 

b1|0) 


empty 


empty 

b0(0) 


• 

• 


circular queue 


high 

address 


a2(N-1) 


b2(N-1) 


a1(N~1) 


b1(N-1) 


bO(N-l) 


d(N-1, n) 


d(N-1, n-1) 

d(N-1, n-1) 


d(N-1. n-2) 

d(N-1. n-2) 


d(N-1, n) 

empty 


empty 


circular queue 


Figure 12-3. Data Memory Organization For N Biquads 


The block register BK should be initialized to 3, and the beginning of each set 
of d values (i.e., /=0...N-1) should be at an address that is a multiple 

of 4 (the last two bits zero), as stated in the case of a single biquad. 
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Example 12-32. HR Filters (N > 1 Biquads) 


* 

* 

* 

* 


TITL HR FILTERS (N > 1 BIQUADS) 


* SUBROUTINE IIR2 

* 

* 


* 


* 

* 

EQUATIONS: 

y(0,n) = x(n) 

* 

* 

FOR (i = 0; 
{ 

i < N; i++) 

* 

d(i,n) = 

a2(i) * d(i,n-2) 

* 

* 

y(i/n) = 
} 

b2(i) * d(i,n-2) 

* 

* 

y(n) = y(N~ 

l,n) 

* 

it 

TYPICAL CALLING SEQUENCE: 

* 

load 

R2 

* 

load 

ARO 

* 

load 

ARl 

* 

load 

IRO 

* 

load 

IRl 

* 

load 

BK 

* 

load 

RC 

* 

CALL 

IIR2 


* 


+ al(i) 
+ bl(i) 


* d(i,n-l) + y(i-l, 

* d(i,n-l) + bO(i) 


* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* - + - 


* 

R2 

1 INPUT SAMPLE x(n) 


* 

ARO 

1 ADDRESS OF FILTER 

COEFFICIENTS (a2(0)) 

* 

ARl 

1 ADDRESS OF DELAY NODE VALUES (d(0,n-2)) 

* 

BK 

1 BK = 3 


* 

IRO 

1 IRO = 4 


* 

IRl 

1 IRl = 4*N-4 


* 

RC 

1 NUMBER OF BIQUADS 

(N) - 2 


* 


* REGISTERS USED AS INPUT: R2, ARO, ARl, IRO, IRl, BK, RC 

* REGISTERS MODIFIED: RO, Rl, R2, ARO, ARl, RC 

* REGISTER CONTAINING RESULT: RO 

* 

* CYCLES: 23 + 6(N-1) WORDS: 17 

•k 

it 

it 

* 

.global IIR2 

* 

IIR2 MPYF3 *ARO, *AR1, RO 

* ; a2(0) * d(0,n-2) -> RO 
MPYF3 *++ARO(l), *AR1—(1)%, R1 

* ; b2(0) * d(0,n“2) -> R1 
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I I 


MPYF3 

ADDF3 

MPYF3 

ADDF3 

MPYF3 

STF 


*++ARO(l),*AR1,R0 ; al(0) * D(0,n-1) -> RO 

RO, R2, R2 ; First sum term of d(0,n). 

*++ARO(l),*AR1—(1)%,R0 ;bl(0) * d(0,n-l) -> RO 
RO, R2, R2 ; Second sum term of d(0,n). 
*++ARO(1),R2,R2 ;b0(0) * d(0,n) -> R2 
R2, *AR1--(1)% 

; Store d(0,n); Point to d(0,n-2) 


I I 
* 

I I 


LOOP 


RPTB 

MPYF3 

ADDF3 

MPYF3 

ADDF3 

MPYF3 

ADDF3 

MPYF3 

ADDF3 

STF 

MPYF3 


LOOP 


Loop for 1 <= i < n 


*++ARO(l),*++ARl(IRO),R0 ;a2{i) * d(i,n-2) -> RO 
R0,R2,R2 ; First sum term of y(i-l,n). 

*++ARO(l),*AR1—(1)%,R1 ;b2(i) * D(i,n-2) ~> R1 
R1,R2,R2 ; Second sum term of y(i-l,n). 

*++ARO(l),*AR1,R0 ;al(i) * d(i,n-l) -> RO 
RO, R2, R2 ; First sum of d(i,n). 

*++ARO(l),*AR1—(1)%,R0 ;bl(i) * d(i,n-l) -> RO 
RO, R2, R2 ; Second sum term of d(i,n). 

R2, *AR1—(1)% 

; Store d(i,n); point to d{i,n-2) 
*++ARO(l), R2, R2 

; bO(i) * d(i,n) -> R2 


FINAL SUMMATION 


ADDF 

ADDF3 

NOP 

NOP 


R0,R2 

R1,R2,R0 

*AR1--(IR1) 
*AR1—(1)% 


First sum term of y(n-l,n) 
Second sum term of y(n“l,n) 

Return to first biquad 
Point to d(0,n-l) 


RETURN SEQUENCE 
RETS 

end 

. end 


Return 


12.4.2.3 Adaptive Filters (LMS Algorithm) 


There are applications in digital signal processing where a filter must be 
adapted over time to keep track of changing conditions. The book "Theory 
and Design of Adaptive Filters" by Treichler, Johnson, and Larimore (Wiley- 
Interscience, 1987) presents the theory of adaptive filters. Although In theory 
both FIR and MR structures can be used as adaptive filters, the stability prob¬ 
lems and the local optimum points that the MR filters exhibit, make them less 
attractive for such an application. Hence, until further research makes MR fil¬ 
ters a better choice, only the FIR filter are used in adaptive algorithms of 
practical applications. 


12 


In an adaptive FIR filter, the filtering equation takes the form; 

y[n] = /7[/7,0] x[/7]+ /7[/7,1] X[A7-1 ]+...+ /7[/7,N-1 ] x[A7 -(N-1)] 
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The filter coefficients are time-dependent. In a least-mean-squares (LMS) 
algorithm, the coefficients are updated by a formula of the form: 

/7[/j+1,/] =/?[a/] + 0 x[/7-/], /=0,1,...,N-1 

3 is a constant for the computation. The updating of the filter coefficients can 
be interleaved with the computation of the filter output so that It takes 3 cycles 
per filter tap to do both. The updated coefficients are written over the old 
filter coefficients. Example 12-33 shows the Implementation of an adaptive 
FIR filter on the TMS320C30. The memory organization and the positioning 
of the data in memory should follow the same rules as the above FIR filter with 
fixed coefficients. 
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Example 12-33. Adaptive FIR Filter (LMS Algorithm) 

* TITL ADAPTIVE FIR FILTER (LMS ALGORITHM) 


★ 

* SUBROUTINE LMS 


* LMS == LMS ADAPTIVE FILTER 

★ 


* 

* 

* EQUATIONS: y(n) = h(n,0)*x(n) + h(n,1)*x(n-l) + ... 

* + h(n,N-l)*x(n-(N-l)) 

* FOR (i = 0; i < N; i++) 

* h(n+l,i)=h(n,i)+ tmuerr * xCn-i) 

* 

* TYPICAL CALLING SEQUENCE: 

* 

* load R4 

* load ARO 

* load ARl 

* load RC 

* load BK 

* CALL FIR 

* 
ic 


* 


* 

* 

* 

Sc 

Sc 

* 


ARGUMENT ASSIGNMENTS: 


ARGUMENT 

FUNCTION 


R4 

SCALE FACTOR (2 * mu 

* err) 

ARO 

ADDRESS OF h(n,N-l) 


ARl 

ADDRESS OF x(n“(N-l)) 


RC 

LENGTH OF FILTER - 2 

(N-2) 

BK 

LENGTH OF FILTER (N) 



* REGISTERS USED AS INPUT: R4, ARO, ARl, RC, BK 

* REGISTERS MODIFIED: RO, Rl, R2, ARO, ARl, RC 

* REGISTER CONTAINING RESULT; RO 

* 

* PROGRAM SIZE; 10 words 

* 

* EXECUTION CYCLES: 12 + 3(N-1) 

* 


* SETUP (i = 0) 



.global LMS 




* 



} 

Initialize 

RO: 

LMS 

MPYF3 

*AR0, *AR1, 

RO 



* 



/ 

h(n,N-l) * 

x(n-(N~l)) -> RO 


LDF 

CM 

O 

O 

7 

Initialize 

R2. 

* 



/ 

Initialize 

Rl: 


MPYF3 

*AR1++{1)%, 

R4, 

Rl 


* 



} 

x(n-(N-l)) 

* tmuerr -> Rl 


ADDF3 

*AR0++(1) , 

Rl, 

Rl 


* 



7 

h(n,N-l) + 

x(n-(N-l)) * 


tmuerr -> Rl 


12 
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* FILTER AND UPDATE ( 1 <= I < N) 


* 

* 

RPTB 

LOOP ; 

Setup the repeat block. 

* 


/ 

Filter: 


MPYF3 

*AR0—(1) ,AR1,: 

RO ; h(n,N-l“i) * x(n-(N-l-i)) -> RO 

11 
* 

ADDF3 

R0,R2,R2 ; 

Multiply and add operation. 

* 


/ 

UPDATE: 


MPYF3 

*AR1++(1)%,R4 

,R1 ;x(n,N-(N-l-i)) * tmuerr -> R1 

11 

STF 

R1,*AR0++(1) ; 

R1 -> h(n+l,N-l-(i“l)) 

LOOP 

ADDF3 

*AR0++(1), Rl, 

R1 

* 

it 


0 

h(n,N-l--i) + x(n“ (N-l-i) ) *tmuerr -> R1 


ADDF3 

RO,R2,RO ; 

Add last product. 

* 

STF 

R1,*-AR0(1) ; 

h(n,0) + x(n) * tmuerr -> h(n+l,0) 


* RETURN SEQUENCE 

* 


RETS ; Return 

* 

* end 

* 

. end 


12.4.3 Matrix-Vector Multiplication 

In matrix-vector multiplication, a K x N matrix of elements m(IJ) having K 
rows and N columns is multiplied by an N x 1 vector to produce a K x 1 result. 
The multiplier vector has elements v(j), and the product vector has elements 
p(i). Each one of the product-vector elements is computed by the expression: 

p(i)= m(i,0) v(0)+ A 77(/,1 ) v(1) + ...+/w(4N-1) v{N-l) / = 0,1 

This is essentially a dot product, and the matrix-vector multiplication contains 
as a special case the dot product presented In Example 12-2. In pseudo-C 
format, the computation of the matrix multiplication is expressed by: 
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for (/=0;/<K;/+ + ) { 

P(/) = 0 

for (/=0; /<N;j + + ) 

p{i) = p(i) + m{i,jy v(J) 

} 

Figure 12-4 shows the data memory organization for matrix-vector multipli¬ 
cation, and Example 12-34 is the TMS320C30 assembly code to implement 
it. Note that in Example 12-34, K (number of rows) should be greater than 0 
and N (number of columns) should be greater than 1. 
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Figure 12-4 
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Example 12-34. Matrix times a vector muitipiication 


* 

* TITL MATRIX TIMES A VECTOR MULTIPLICATION 

* 

* 

* SUBROUTINE MAT 

* 

* MAT == MATRIX TIMES A VECTOR OPERATION 

* 

* 

* TYPICAL CALLING SEQUENCE: 

* 


* 

load 

ARO 

* 

load 

ARl 

* 

load 

AR2 

* 

load 

AR3 

* 

load 

R1 

* 

CALL 

MAT 


* 


* 

* ARGUMENT ASSIGNMENTS: 

* argument | FUNCTION 


* 



— 



* 

ARO 

1 ADDRESS OF M(0,0) 




* 

ARl 

1 ADDRESS OF V(0) 




* 

AR2 

1 ADDRESS OF P(0) 




* 

AR3 

j NUMBER OF ROWS - 1 

(K- 

-1) 


* 

•k 

R1 

j NUMBER OF COLUMNS - 

2 

(N- 

2) 

* 

REGISTERS 

USED AS INPUT: ARO, ARl, 

AR2 

, AR3, R1 

* 

REGISTERS 

MODIFIED: RO, R2, ARO, 

ARl, 

AR2, AR3, 

* 

* 


RC, RSA, REA 




* 

* 

* 

PROGRAM SIZE: 11 




* 

EXECUTION 

CYCLES; 6 + 10 * K + K 

* 

(N 

" 1) 


* 


* 


* 

.global MAT 

* 

* SETUP 

* 


MAT 

* 


LDI R1,IR0 

ADDI 2,IRQ 


number of columns-2 -> IRQ 
IRO = N 


* FOR 

* 

ROWS 


12 


* 

* 

* FOR 

* 


* 


* 


(i = 0; 

LDF 

MPYF3 


(j = 1; 

RPTS 

MPYF3 

ADDF3 


i < K; i++) LOOP OVER THE ROWS. 

0.0,R2 ; initialize R2 

*AR0++(1),*AR1++(1),RO 

; m(i,0) * v(0) -> RO 

j < N; j++) DO DOT PRODUCT OVER COLUMNS 

R1 ; multiply a row by a column. 

*AR0++(1),*AR1++(1),R0 ;m(i,j) * v(j) -> RO 
R0,R2,R2 ; m(i,j-l) * v(j-l) + R2 -> R2. 
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DBD 

AR3,ROWS ; 

counts 

the number of rows left 

ADDF 

R0,R2 ; 

last accumulate. 

STF 

R2,*AR2+-h(l) ; 

result 

-> p(i) 

NOP 

*—ARl(IRO) 

set ARl 

to point to v(0). 


* 11! DELAYED BRANCH HAPPENS HERE 111 

* 

* RETURN SEQUENCE 


RETS ; return 

* 

* end 

* 

. end 


12.4.4 Fast Fourier Transforms (FFT) 

Fourier transforms are an important tool often used in digital signal processing 
systems. The purpose of the transform is to convert information from the time 
domain to the frequency domain. The inverse Fourier transform converts in¬ 
formation back to the time domain from the frequency domain. Implementa¬ 
tion of Fourier transforms that are computationally efficient are known as Fast 
Fourier Transforms (FFTs). The theory of FFTs can by found in books such 
as "DFT/FFT and Convolution Algorithms" by C.S. Burrus and T.W. Parks 
(John Wiley, 1985), and in the book "Digital Signal Processing Applications 
with the TMS320 Family". 


The TMS320C30 has many features that permit very efficient Implementation 
of numerically intensive algorithms. Some of these features are particularly 
well suited for FFTs. The high speed of the device (60 ns cycle-time) makes 
easier the implementation of real-time algorithms, while the floating-point 
capability eliminates the problems associated with dynamic range. The pow¬ 
erful indexing scheme in Indirect addressing facilitates the access of FFT but¬ 
terfly legs that have different spans. A construct that reduces the looping 
overhead in algorithms heavily dependent on loops (such as the FFTs), is the 
repeat block implemented by the RPTB Instruction. This construct gives the 
efficiency of In-line coding, but has the form of a loop. Since the output of 
the FFT is in scrambled (bit-reversed) order when the Input Is in regular order, 
there Is a need to restore It In the proper order. In the TMS320C30, there is 
no need to spend extra cycles for this rearrangement. The device has a special 
form of indirect addressing (bit-reversed addressing mode), that can be used 
when the FFT output Is needed. This mode permits accessing the FFT output 
In the proper order. 

Fast Fourier Transform is a label for a collection of algorithms Implementing 
efficient conversion from time to frequency domain. Different types of FFT 
are: 


12 


• Radlx-2 and radix-4 algorithms(dependlng on the size of the FFT but¬ 
terfly) 

• Decimation In time or frequency (DIT or DIF) 

• Complex or real FFTs 

• FFTs of different lengths, etc. 
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The present implementation of the FFT was based on programs contained in 
the book "DFT/FFT and Convolution Algorithms" by C.S. Burrus and T.W. 
Parks, and in the paper "Real-Valued Fast Fourier Transform Algorithms" by 
H.V. Sorensen et al. (IEEE Trans, on ASSP, June 1987). 

Examples 12-35 and 12-36 show the implementation of a complex radlx-2, 
DIF, FFT on the TMS320C30. Example 12-35 contains the generic code of 
the FFT that can be used with any length number. However, for the complete 
implementation of an FFT, a table of twiddle factors (sines/cosines) is needed, 
and this table depends on the size of the transform. To retain the generic form 
of Example 12-35, the table with the twiddle factors (containing 1 1/4 com¬ 
plete cycles of a sine) is presented separately in Example 12-36 for the case 
of a 64-point FFT. A full cycle of a sine should have a number of points equal 
to the FFT size. In Example 12-36, the FFT length N and M, which is equal 
to the logarithm of N to base equal to the radix, are defined. M is the number 
of stages of the FFT. For a 64-point FFT, M=6 when using a radlx-2 algo¬ 
rithm or M=3 when using a radix-4 algorithm. If the table with the twiddle 
factors and the FFT code are kept in separate files, they should be connected 
at link time. 


12 
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Example 12-35. Complex, Radix-2, DIF FFT 

* 

* TITL COMPLEX, RADIX-2, DIF FFT 

* 

* GENERIC PROGRAM FOR A LOOPED-CODE RADIX-2 FFT COMPUTATION IN 320C30 

* 

* THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 111. 

* THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY. THE COMPUTATION 

* IS DONE IN-PLACE, BUT THE RESULT IS MOVED TO ANOTHER MEMORY 

* SECTION TO DEMONSTRATE THE BIT-REVERSED ADDRESSING. 

* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA SECTION. 

* THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE GENERIC 

* NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF THE FFT 

* N AND LOG2(N) ARE DEFINED IN A .GLOBL DIRECTIVE AND SPECIFIED 

* DURING LINKING. 

* 

* 

.globl FFT ; Entry point for execution 

.globl N ; FFT size 

.globl M ; LOG2(N) 

.globl SINE ; Address of sine table 

INP .usect "IN",1024 ; Memory with input data 

.BSS OUTP,1024 ; Memory with output data 

. text 

* INITIALIZE 


FFTSIZ 

.word 

N 


LOGFFT 

.word 

M 


SINTAB 

.word 

SINE 


INPUT 

.word 

INP 


OUTPUT 

.word 

OUTP 


FFT: 

LDP 

FFTSIZ 

Command to load data page pointer 


LDI 

@FFTSIZ,IR1 



LSH 

-2,IR1 

IRl=N/4, pointer for SIN/COS table 


LDI 

0,AR6 

ARB holds the current stage number 


LDI 

@FFTSIZ,IR0 



LSH 

1,IR0 

IR0=2*N1 (because of real/imag) 


LDI 

@FFTSIZ,R7 

R7=N2 


LDI 

1,AR7 

Initialize repeat counter of first 


LDI 

1,AR5 

Initialize IE index (AR5=IE) 

* OUTER LOOP 



LOOP: 

NOP 

*++AR6{l) 

Current FFT stage 


LDI 

@INPUT,ARO 

ARO points to X(I) 


ADD I 

R7,AR0,AR2 

AR2 points to X(L) 


LDI 

AR7,RC 



SUBI 

1,RC i 

; RC should be one less than desired 
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12 


* FIRST LOOP 




RPTB 

BLKl 




ADDF 

*AR0,*AR2,R0 

; RO=X(I)+X(L) 



SUBF 

*AR2+-l-,*AR0H-+, 

R-L R1=X(I)“X(L) 



ADDF 

*AR2,*AR0,R2 

; R2=Y(I)+Y(L) 



SUBF 

*AR2,*AR0,R3 

; R3=Y(I)-Y(L) 



STF 

R2, SAPO¬ 

; Y(I)=R2 and... 

11 


STF 

RS, *AR2— 

; Y(L)=R3 

BLKl 

STF 

RO,*ARO++(IRO) 

; X(I)=R0 and... 

11 


STF 

R1,*AR2++(IR0) 

; X(L)=R1 and ARO,2 = ARO,2 + 2*n 

* 

IF 

THIS IS 

THE LAST STAGE, 

YOU ARE DONE 



CMP I 

@LOGFFT,AR6 




BZD 

END 


* 

MAIN INNER 

LOOP 




LDI 

2,AR1 

; Init loop counter for inner loop 



LDI 

@SINTAB,AR4 

; Initialize lA index (AR4=IA) 

INLOP 

ADD I 

AR5,AR4 

; IA=IA+IE; AR4 points to cosine 



LDI 

AR1,AR0 




ADD I 

2,AR1 

; Increment inner loop counter 



ADD I 

@INPUT,ARO 

; (X(I),Y(I)) pointer 



ADD I 

R7,AR0,AR2 

; (X(L),Y(L)) pointer 



LDI 

AR7,RC 




SUBI 

1,RC 

; RC should be one less than desired # 



LDF 

*AR4,R6 

; R6=SIN 

* 

SECOND LOOP 




RPTB 

BLK2 




SUBF 

*AR2,*AR0,R2 

; R2=X(I)~X(L) 



SUBF 

*+AR2,*+AR0,Rl 


* 




? R1=Y(I)-Y(L) 



MPYF 

R2,R6,R0 

;R0=R2*SIN and... 

I I 


ADDF 

*+AR2,*+AR0,R3 


* 




; R3=Y(I)+Y(L) 



MPYF 

R1,*+AR4(IR1), 

R3 ;R3 = R1 * COS and ... 

I I 


STF 

R3,*+AR0 

; Y(I)=Y(I)+Y(L) 



SUBF 

R0,R3,R4 

; R4=R1*C0S-R2*SIN 



MPYF 

R1,R6,R0 

; R0=R1*SIN and... 

I I 


ADDF 

*AR2,*AR0,R3 

; R3=X(I)+X(L) 



MPYF 

R2,*+AR4(IR1), 

R3 ; R3 = R2 * COS and... 

I I 


STF 

R3,*AR0++(IR0) 


* 




; X(I)=X(I)+X{L) and AR0=AR0+2*N1 



ADDF 

R0,R3,R5 

; R5=R2*C0S+R1*SIN 

BLK2 

STF 

R5,*AR2++(IR0) 

; X(L)=R2*C0S+R1*SIN, incr AR2 and... 

11 


STF 

R4,*+AR2 

; Y(L)=Rl*COS-R2*SIN 



CMP I 

R7,AR1 




BNE 

INLOP 

; Loop back to the inner loop 



LSH 

1,AR7 

; Increment loop counter for next time 
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BRDP 

LOOPS 

Next FFT stage (delayed) 


LSH 

1,AR5 

IE=2*IE 


LDI 

R7,IR0 

N1=N2 


LSH 

-1,R7 

N2=N2/2 

* STORE RESULT 

OUT USING BIT- 

-REVERSED ADDRESSING 

END: 

LDI 

@FFTSIZ,RC 

RC=N 


SUBI 

1,RC 

RC should be one less than desired 


LDI 

@FFTSIZ,IRO 

IRO=size of FFT=N 


LDI 

2,IR1 



LDI 

@INPUT,ARO 



LDI 

@0UTPUT,AR1 



RPTB 

BITRV 



LDF 

*+ARO(1),R0 


I I 

LDF 

*AR0++(IR0)B,R1 

BITRV 

STF 

R0,*+AR1(1) 


I I 

STF 

R1,*ARH-+(IR1) 


SELF 

BR 

SELF 

Branch to itself at the end 


. end 
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Example 12-36. Table With Twiddle Factors For A 64-Point FFT 

* 

*TITL TABLE WITH TWIDDLE FACTORS FOR A 64-POINT FFT 

•k 

* FILE TO BE LINKED WITH THE SOURCE CODE FOR A 64-POINT, RADIX-2 FFT. 

* 

.globl 

SINE 

.globl 

N 

.globl 

M 

N .set 

64 

M . set 

6 

. data 


SINE 


.float 

0.000000 

.float 

0.098017 

.float 

0.195090 

.float 

0.290285 

.float 

0.382683 

.float 

0.471397 

.float 

0.555570 

.float 

0.634393 

.float 

0.707107 

.float 

0.773010 

.float 

0.831470 

.float 

0.881921 

.float 

0.923880 

.float 

0.956940 

.float 

0.980785 

.float 

0.995185 

COSINE 


.float 

1.000000 

. float 

0.995185 

.float 

0.980785 

.float 

0.956940 

.float 

0.923880 

.float 

0.881921 

.float 

0.831470 

.float 

0.773010 

.float 

0.707107 

.float 

0.634393 

.float 

0.555570 

.float 

0.471397 

.float 

0.382683 

.float 

0.290285 

[■HPIII 

0.195090 

.float 

0.098017 

HHllll 

0.000000 

.float 

-0.098017 

.float 

-0.195090 

.float 

-0.290285 

.float 

-0.382683 

.float 

-0.471397 

.float 

-0.555570 

.float 

-0.634393 

.float 

-0.707107 

.float 

-0.773010 


12-68 



Software Applications - Application-Oriented Operations 


.float 

-0.831470 

.float 

-0.881921 

.float 

-0.923880 

.float 

-0.956940 

.float 

-0.980785 

.float 

-0.995185 

.float 

-1.000000 

.float 

-0.995185 

.float 

-0.980785 

.float 

-0.956940 

.float 

-0.923880 

.float 

-0.881921 

.float 

-0.831470 

.float 

-0.773010 

.float 

-0.707107 

.float 

-0.634393 

.float 

-0.555570 

.float 

-0.471397 

.float 

-0.382683 

.float 

-0.290285 

.float 

-0.195090 

.float 

-0.098017 

.float 

0.000000 

.float 

0.098017 

.float 

0.195090 

.float 

0.290285 

.float 

0.382683 

.float 

0.471397 

.float 

0.555570 

.float 

0.634393 

.float 

0.707107 

.float 

0.773010 

.float 

0.831470 

.float 

0.881921 

.float 

0.923880 

.float 

0.956940 

.float 

0.980785 

.float 

0.995185 


The radix-2 algorithm has a great tutorial value because it is relatively easy to 
understand how the FFT algorithm functions. However, radix-4 implementa¬ 
tions can increase the speed of the execution by reducing the overall arith¬ 
metic required. Example 12-37 shows the generic implementation of a 
complex, DIF FFT In radix-4. A companion table, like the one in Example 
12-36, should have a value of M equal to the logN, where the base of the lo¬ 
garithm is four. 
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Example 12-37. Complex, Radix-4, DIF FFT 

* 

* TITL COMPLEX, RADIX-4, DIF FFT 

* 

* GENERIC PROGRAM TO DO A LOOPED-CODE RADIX-4 FFT COMPUTATION IN 

* THE TMS320C30. 

* 

* THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 117. 

* THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY, AND THE COMPUTATION 

* IS DONE IN-PLACE. 

* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA SECTION. 

* THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE GENERIC 

* NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF THE 

* FFT N AND LOG4(N) ARE DEFINED IN A .GLOBL DIRECTIVE AND SPECIFIED 

* DURING LINKING. 

* 

* IN ORDER TO HAVE THE FINAL RESULT IN BIT-REVERSED ORDER, THE TWO 

* MIDDLE BRANCHES OF THE RADIX-4 BUTTERFLY ARE INTERCHANGED DURING 

* STORAGE. NOTE THIS DIFFERENCE WHEN COMPARING WITH THE PROGRAM IN 

* P. 117 OF THE BURRUS AND PARKS BOOK. 

* 


.globl FFT ; 

.globl N ; 

.globl M ; 

.globl SINE 

.usect ”IN”,INP,1024 ; 

.text 

* INITIALIZE 


Entry point for execution 

FFT size 

LOG4(N) 

Address of sine table 
Memory with input data 


TEMP .word $+2 

STORE .word FFTSIZ 

.word N 
.word M 
.word SINE 
.word INP 


Beginning of temp storage area 


12 


.BSS FFTSIZ,! 

. BSS LOGFFT,1 

.BSS SINTAB,1 

.BSS INPUT,1 

. BSS STAGE,1 

.BSS RPTCNT,! 

.BSS lEINDX,! 

.BSS LPCNT,1 

.BSS JT,1 

.BSS IA1,1 


FFT size 
LOG4(FFTSIZ) 

Sine/cosine table base 

Area with input data to process 

FFT stage # 

Repeat counter 

IE index for sine/cosine 

Second-loop count 

JT counter in program, P. 117 

lAl index in program, P. 117 
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FFT: 


* 

INITIALIZE 

DATA LOCATIONS 




LDP 

TEMP 

Command to load data page counter 



LDI 

@TEMP,ARO 




LDI 

@STORE,ARl 




LDI 

*ARO++,RO 

Xfer data from one memory to the other 


STI 

R0,*AR1++ 




LDI 

*ARO++;RO 




STI 

R0,*AR1++ 




LDI 

*ARO++,RO 




STI 

R0,*AR1++ 




LDI 

*ARO,RO 




STI 

R0,*AR1 




LDP 

FFTSIZ 

Command to load data page pointer 



LDI 

@FFTSIZ,RO 




LDI 

@FFTSIZ,IRO 




LDI 

@FFTSIZ,IR1 




LDI 

0,AR7 




STI 

AR7,@STAGE 

@STAGE holds the current stage number 


LSH 

1,IR0 

IR0=2*N1 (because of real/imag) 



LSH 

-2,IR1 

IRl=N/4, pointer for SIN/COS table 



LDI 

1,AR7 




STI 

AR7,@RPTCNT 

Initialize repeat counter of first 

loop 


LSH 

-2,R0 




STI 

AR7,@IEINDX 

Initialize IE index 



ADD I 

2,R0 




STI 

RO,@JT 

JT=R0/2+2 



SUBI 

2,R0 




LSH 

1,R0 

R0=N2 


* 

OUTER LOOP 




LOOP: 





LDI 

@INPUT,ARO 

ARO points to X(I) 



ADD I 

R0,AR0,AR1 

ARl points to X(I1) 



ADD I 

R0,AR1,AR2 

AR2 points to X(I2) 



ADDI 

R0,AR2,AR3 

AR3 points to X(I3) 



LDI 

@RPTCNT,RC 




SUBI 

1,RC ] 

RC should be one less than desired 

# 

* 

FIRST LOOP 





RPTB 

BLKl 




ADDF 

*+AR0,*+AR2,Rl 


* 



R1=Y(I)+Y(I2) 



ADDF 

*+AR3,*+ARl,R3 


* 



R3=Y(I1)+Y(I3) 



ADDF 

R3,R1,R6 

R6=R1+R3 



SUBF 

*+AR2,*+AR0,R4 


* 



R4=Y(I)-Y(I2) 



STF 

R6,*+AR0 

Y(I)=R1+R3 



SUBF 

R3,R1 

R1=R1-R3 



LDF 

*AR2,R5 

R5=X(I2) 


11 

LDF 

*+ARl,R7 

R7=Y(I1) 



ADDF 

*AR3,*AR1,R3 

R3=X(I1)+X(I3) 



ADDF 

R5,*AR0,R1 

R1=X(I)+X(I2) 


11 

STF 

R1,*+AR1 

Y(I1)=R1-R3 
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ADDF 

R3,R1,R6 


R6=RH-R3 



SUBF 

R5 *AR0,R2 


R2=X(I)-X(I2) 


11 

STF 

R6,*AR0+-l-(IR0) 

; X(I)=R1+R3 



SUBF 

R3,R1 


R1=R1“R3 



SUBF 

*AR3,*AR1,R6 


R6=X(I1)-X(I3) 



SUBF 

R7,*+AR3,R3 


-R3=Y(I1)~Y(I3) 


11 

STF 

R1 *AR1++(IR0) 

;X(I1)=R1-R3 



SUBF 

R6,R4,R5 


R5=R4-R6 



ADDF 

R6,R4 


R4=R4+R6 



STF 

R5,*+AR2 


Y(I2)=R4-R6 


11 

STF 

R4,*+AR3 


Y(I3)=R4+R6 



SUBF 

R3,R2,R5 


R5=R2-R3 



ADDF 

R3,R2 


R2=R2+R3 


BLKl STF 

R5,*AR2++(IR0) 

; X(I2)=R2-R3 


11 

STF 

R2,*AR3++{IR0) 

; X(I3)=R2+R3 


* 

IF THIS IS 

THE LAST STAGE 

t 

YOU ARE DONE 



LDI 

@STAGE,AR7 





ADD I 

1,AR7 





CMP I 

@L0GFFT,AR7 





BZD 

END 





STI 

AR7,@STAGE 


Current FFT stage 


* 

MAIN INNER 

LOOP 





LDI 

1,AR7 





STI 

AR7,@IA1 


; Init lAl index 



LDI 

2,AR7 


1 



STI 

AR7,@LPCNT 


; Init loop counter for 

inner loop 

INLOP: 






LDI 

2,AR6 


; Increment inner loop < 

counter 


ADDI 

@LPCNT,AR6 





LDI 

@LPCNT,ARO 





LDI 

@IA1,AR7 





ADDI 

@IEINDX,AR7 


; IA1=IA1+IE 



ADDI 

@INPUT,ARO 


; (X(I),Y(I)) pointer 



STI 

AR7,@IA1 





ADDI 

R0,AR0,AR1 

. 

; (X(I1),Y(I1)) pointer 



STI 

AR6,@LPCNT 





ADDI 

R0,AR1,AR2 


; (X(I2),Y(I2)) pointer 



ADDI 

R0,AR2,AR3 


; (X(I3),Y(I3)) pointer 



LDI 

@RPTCNT,RC 





SUBI 

1,RC 


RC should be one less 

than desired 


CMP I 

@JT,AR6 


If LPCNT=JT, go to 



BZD 

SPCL 

i 

special butterfly 



LDI 

@IA1,AR7 





LDI 

@IA1,AR4 





ADDI 

@SINTAB,AR4 


Create cosine index AR4 


ADDI 

AR4,AR7,AR5 





SUBI 

1,AR5 


IA2=IA1+IA1-1 



ADDI 

AR7,AR5,AR6 





SUBI 

1,AR6 


IA3=IA2+IA1-1 
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; SECOND LOOP 



RPTB 

BLK2 



ADDF 

*-rAR2 *+ARO ,R3 

* 



R3=Y(I)+Y(I2) 


ADDF 

*+AR3,*+ARl,R5 

★ 

ADDF 

R5,R3,R6 

R5=Y(I1)+Y(I3) 

R6=R3+R5 


SUBF 

*+AR2 *+AR0,R4 

* 

SUBF 

R5,R3 

R4=Y(I)-Y(I2) 

R3=R3-R5 


ADDF 

*AR2,*AR0,R1 

R1=X(I)+X(I2) 


ADDF 

*AR3,*AR1,R5 

R5=X(I1)-HX(I3) 


MPYF 

R3,*+AR5(IR1) 

R6 R6=R3*C02 

11 

STF 

R6,*+AR0 

Y(I)=R3+R5 


ADDF 

R5,R1,R7 

R7=R1+R5 


SUBF 

*AR2,*AR0,R2 

R2=X(I)-X(I2) 


SUBF 

R5,R1 

R1=R1-R5 


MPYF 

R1,*AR5,R7 

R7=R1*SI2 

11 

STF 

R7,*AR0++(IR0] 

; X(I)=R1+R5 


SUBF 

R7,R6 

R6=R3*C02-R1*SI2 


SUBF 

*+AR3 *+ARl,R5 

* 

MPYF 

R1,*+AR5(IR1) 

R5=Y(I1)-Y(I3) 

R7 ;R7=R1*C02 

11 

STF 

R6,*+AR1 

Y(I1)=R3*C02-R1*SI2 


MPYF 

R3,*AR5,R6 

R6=R3*SI2 


ADDF 

R7,R6 

R6=R1*C02+R3*SI2 


ADDF 

R5,R2,R1 

R1=R2+R5 


SUBF 

R5,R2 

R2=R2-R5 


SUBF 

*AR3,*AR1,R5 

R5=X(I1)-X(I3) 


SUBF 

R5,R4,R3 

R3=R4-R5 


ADDF 

R5,R4 

R4=R4-fR5 


MPYF 

R3,*+AR4(IR1) 

R6 ; R6=R3*C01 

11 

STF 

R6,*ARH-+(IR0) 

; X(I1)=R1*C02+R3*SI2 


MPYF 

R1,*AR4,R7 

R7=R1*SI1 


SUBF 

R7,R6 

R6=R3*C01-R1*SI1 


MPYF 

R1,*+AR4(IR1) 

R6 ; R6=R1*C01 

11 

STF 

R6,*+AR2 

Y(i2)=R3*C01-Rl*SIl 


MPYF 

R3,*AR4,R7 

R7=R3*SI1 


ADDF 

R7,R6 

R6=R1*C01+R3*SI1 


MPYF 

R4,*+AR6(IR1) 

R6 ; R6=R4*C03 

11 

STF 

R6,*AR2++(IR0) 

; X(I2)=R1*C01+R3*SI1 


MPYF 

R2,*AR6,R7 

R7=R2*SI3 


SUBF 

R7,R6 

R6=R4*C03-R2*SI3 


MPYF 

R2,*+AR6(IR1) 

R6 ; R6=R2*C03 

11 

STF 

R6,*+AR3 

Y(i3)=R4*C03-R2*SI3 


MPYF 

R4,*AR6,R7 

R7=R4*SI3 


ADDF 

R7,R6 

R6=R2*C03+R4*SI3 
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12 


BLK2 

* 

STF 

R6,*AR3++(IR0) 

/ 


CMP I 

@LPCNT,RO 


BP 

INLOP ; 


BR 

CONT 

* SPECIAL BUTTERFLY FOR W=J 

SPCL 

LDI 

IR1,AR4 


LSH 

-1,AR4 ; 


ADDI 

@SINTAB,AR4 ; 


RPTB 

BLK3 


ADDF 

*AR2,*AR0,R1 ; 


SUBF 

*AR2,*AR0,R2 ; 

•k 

ADDF 

*+AR2,*+AR0,R3 

•k 

SUBF 

*+AR2,*+AR0,R4 


ADDF 

*AR3,*AR1,R5 


SUBF 

R1,R5,R6 


ADDF 

R5,R1 

k 

ADDF 

*+AR3,*+ARl,R5 


SUBF 

R5,R3,R7 ; 


ADDF 

R5,R3 ; 


STF 

R3,*+AR0 ; 

I I 

STF 

R1,*AR0++(IR0) 


SUBF 

*AR3,*AR1,R1 ; 

k 

SUBF 

*+AR3,*+AR1,R3 


STF 

R6,*+AR1 ; 

I I 

STF 

R7,*AR1++(IR0) 


ADDF 

R3,R2,R5 


SUBF 

R2,R3,R2 


SUBF 

R1,R4,R3 


ADDF 

R1,R4 


SUBF 

R5,R3,R1 


MPYF 

*AR4,R1 


ADDF 

R5,R3 


MPYF 

*R4,R3 

I I 

STF 

R1,*+AR2 


SUBF 

R4,R2,R1 


MPYF 

*AR4,R1 

I I 

STF 

R3,*AR2+-I-{IR0) 


ADDF 

R4,R2 


MPYF 

*AR4,R2 

BLK3 

STF 

R1,*+AR3 

I I 

STF 

R2,*AR3++(IR0) 


CMP I 

@LPCNT,RO 


BPD 

INLOP ; 


x(i3)=R2*C03+R4*SI3 


LOOP BACK TO THE INNER LOOP 


Point to SIN(45) 

Create cosine index AR4=C021 


R1=X(I)+X(I2) 

R2=X(I)-X(I2) 

R3=Y(I)+Y(I2) 

R4=Y(I)-Y(I2) 

R5=X(I1)+X(I3) 

R6=R5-R1 

R1=R1+R5 

R5=Y(I1)+Y(I3) 
R7=R3-R5 
R3=R3+R5 
Y(I)=R3+R5 
} X(I)=R1+R5 

R1=X(I1)“X(I3) 

R3=Y(I1)-Y(I3) 
Y(I1)=R5”R1 
; X(I1)=R3-R5 

R5=R2+R3 
R2=-R2+R3 
R3=R4-R1 
R4=R4+R1 
R1=R3-R5 
R1=R1*C021 
R3=R3+R5 
R3=R3*C021 
Y(I2)=(R3-R5)*C021 
R1=R2-R4 
R1-R1*C021 

; X(I2)=(R3+R5)*C021 

R2=R2+R4 
R2=R2*C021 
Y(I3)=-(R4-R2)*C021 
; X(I3)=(R4+R2)*C021 


Loop back to the inner loop 
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CONT 

LDI 

@RPTCNT,AR7 




LDI 

@IEINDX,AR6 




LSH 

2,AR7 

/ 

Increment repeat counter for 

* 

STI 

AR7,@RPTCNT 

/ 

next time 


LSH 

2,AR6 

/ 

IE=4*IE 


STI 

AR6,@IEINDX 




LDI 

RO,IRO 

# 

N1=N2 


LSH 

-3,R0 




ADD I 

2,R0 




STI 

RO,@JT 

/ 

JT=N2/2+2 


SUBI 

2,R0 




LSH 

1,R0 

} 

N2=N2/4 


BR 

LOOP 

} 

Next FFT stage 

* STORE RESULT 

OUT USING BIT-REVERSED ADDRESSING 

END: 

LDI 

@FFTSIZ,RC 

} 

RC=N 


SUBI 

1,RC 

J 

RC should be one less than desired 


LDI 

@FFTSIZ,IRO 

} 

IRO=size of FFT=N 


LDI 

2,IR1 




LDI 

@INPUT,ARO 




LDP 

STORE 




LDI 

@STORE,ARl 




RPTB 

BITRV 




LDF 

*H-AR0(1) ,R0 



I I 

LDF 

*ARO++(IRO)<, 

R1 


BITRV 

STF 

RO *+ARl(l) 



I I 

STF 

R1,*AR1++(IR1) 


SELF 

BR 

SELF 

. 

Branch to itself at the end. 


. end 





Most often, the data to be transformed is a sequence of real numbers. In this 
case, the FFT demonstrates certain symmetries that permit the reduction of the 
computational load even further. Example 12-38 shows the generic imple¬ 
mentation of a real-valued, radix-2 FFT. For such an FFT, the total number 
of storage required for a length-N transform is only N locations Instead of 2N 
that are necessary In a complex FFT. The rest of the points can be recovered 
based on the symmetry conditions. 
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Example 12-38. Real, Radix-2 FFT 


* TITL REAL, RADIX-2 FFT 

* 

* GENERIC PROGRAM TO DO A RADIX-2 REAL FFT COMPUTATION IN 320C30. 

* 

* THE PROGRAM IS TAKEN FROM THE PAPER BY SORENSEN ET AL., JUNE 1987 

* ISSUE OF THE TRANSACTIONS ON ASSP. 

* 

* THE REAL DATA RESIDE IN INTERNAL MEMORY. THE COMPUTATION IS 

* DONE IN-PLACE. THE BIT-REVERSAL IS DONE AT THE BEGINNING OF 

* THE PROGRAM. 

* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA 

* SECTION. THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE 

* THE GENERIC NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE 

* SIZE OF THE FFT N AND LOG2(N) ARE DEFINED IN A .GLOBL DIRECTIVE 

* AND SPECIFIED DURING LINKING. THE LENGTH OF THE TABLE IS 

* N/4 + N/4 = N/2. 

* 

* 

.globl FFT ; Entry point for execution 

.globl N ; FFT size 

.globl M ; LOG2(N) 

.globl SINE ; Address of sine table 

.bss INP,1024 ; Memory with input data 

. text 

* INITIALIZE 

FFTSIZ .word N 

LOGFFT .word M 

SINTAB .word SINE 

INPUT .word INP 

FFT: LDP FFTSIZ ; Command to load data page printer 

* DO THE BIT-REVERSING AT THE BEGINNING 


LDI 

@FFTSIZ,RC 

; RC=N 


SUBI 

1,RC 

; RC should be 

one less than desired 

LDI 

@FFTSIZ,IRO 



LSH 

-1,IR0 

? IRO=half the 

size of FFT=N/2 

LDI 

@INPUT,ARO 



LDI 

@INPUT,AR1 



RPTB 

BITRV 



CMP I 

AR1,AR0 

; Exchange locations only 

BGE 

CONT 

; if AR0<AR1 


LDF 

*ARO,RO 



LDF 

*AR1,R1 



STF 

R0,*AR1 



STF 

R1,*AR0 
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CONT 

NOP 

♦ARO++ 







BITRV 

NOP 

*AR1+-^(IR0)B 







* LENGTH-TWO 

BUTTERFLIES 








LDI 

@INPUT,ARO 


ARO points to X(I) 






LDI 

IRO,RC 


Repeat N/2 times 






SUBI 

1,RC 


RC should be one less 

than 

desired 

# 



RPTB 

BLKl 








ADDF 

*+ARO,*ARO++,RO 






★ 




R0=X(I)+X(I-i'l) 






SUBF 

*AR0,*-AR0,R1 







* 




Rl=X(I)-X(I-Kl) 





BLKl 

STF 

RO,*-ARO 


X(I)=X(I)+X(I+1) 





11 

STF 

R1,*AR0++ 


X(I+1)=X(I)-X(I+1) 





* FIRST PASS 

OF THE DO-20 LOOP 

(STAGE K=2 IN DO-10 LOOP) 





LDI 

@INPUT,ARO 


ARO points to X(I) 






LDI 

2, IRQ 


IR0=2=N2 






LDI 

@FFTSIZ,RC 








LSH 

-2,RC 


Repeat N/4 times 






SUBI 

1,RC 


RC should be one less 

than 

desired 

# 



RPTB 

BLK2 








ADDF 

*+ARO(IRO),*ARO++(IRO),R0 





* 




R0=X(I)+X(I+2) 






SUBF 

*ARO,*-ARO(IRO) 

,R1 





* 




Rl=X(I)-X(I+2) 






NEGF 

*+ARO,RO 


R0=-X(I+3) 





I I 

STF 

RO,*-ARO(IRQ) 


X(I)=X(I)+X(I+2) 





BLK2 

STF 

R1,*AR0++(IR0) 







* 




X(I+2)=X(I)-X(I+2) 





11 

STF 

RO,*+ARO 


X(I+3)=-X(I+3) 





* MAIN 

LOOP 

(FFT STAGES) 








LDI 

@FFTSIZ,IRO 








LSH 

-2,IR0 


IR0=index for E 






LDI 

3,R5 


R5 holds the current 

stage 

number 




LDI 

1,R4 


R4=N4 






LDI 

2,R3 


R3=N2 





LOOP 

LSH 

-1,IR0 


E=E/2 






LSH 

1,R4 


N4=2*N4 






LSH 

1,R3 


N2=2*N2 





* INNER LOOP 

(DO-20 LOOP IN 

CHE PROGRAM) 






LDI 

@INPUT,AR5 


AR5 points to X(I) 





INLOP 

LDI 

IR0,AR0 








ADD I 

@SINTAB,ARO 


ARO points to SIN/COS 

table 


■ 19 


LDI 

R4,IR1 


IR1=N4 






LDI 

AR5,AR1 








ADD I 

1,AR1 


ARl points to X(I1)=X(I+J) 





LDI 

AR1,AR3 
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12 



ADD I 

R3,AR3 

AR3 points to X(13)=X(1+J+N2) 


LDI 

AR3,AR2 



SUBI 

2,AR2 

AR2 points to X(12)=X(I-J+N2) 


ADD I 

R3,AR2,AR4 

AR4 points to X(14)=X(I-J+Nl) 


LDF 

*AR5++(IR1),R0 

* 



RO=X(I) 


ADDF 

*+AR5(IRl),R0, 

R1 

* 



R1=X(I)+X(I+N2) 


SUBF 

R0,*++AR5(IR1) 

,R0 

* 



R0=-X(I)+X(I+N2) 

11 

STF 

R1,*-AR5(IR1) 

X(I)=X(I)+X(I+N2) 


NEGF 

RO 

R0=X(I)-X(I+N2) 


NEGF 

*+-l-AR5(IRl) ,R1 

* 



R1=-X(I+N4+N2) 

11 

STF 

RO,*AR5 

X(I+N2)=X(I)-X(I+N2) 


STF 

R1,*AR5 

X (I-HN4+N2 ) =-X (I+N4+N2 ) 


* 

INNERMOST LOOP 



LDI 

@FFTSIZ,IR1 



LSH 

-2,IR1 

IRl=separation between SIN/COS 


LDI 

R4,RC 



SUBI 

2,RC 

Repeat N4-1 times 


RPTB 

BLK3 



MPYF 

*AR3 *+ARO(IRl),R0 

* 



RO=X(I3)*COS 


MPYF 

*AR4,*AR0,R1 

R1=X(I4)*SIN 


MPYF 

*AR4 *+ARO(lRl),R1 ; Rl=X(I4)*COS 

11 

ADDF 

R0,R1,R2 

R2=X(I3)*C0S+X(I4)*SIN 


MPYF 

*AR3,*ARO++(IRO),RO 

* 



R0=X(I3)*SIN 


SUBF 

R0,R1,R0 

R0=-X(I3)*SIN+X(I4)*COS 


SUBF 

*AR2,R0,R1 

R1=-X(I2)+R0 


ADDF 

*AR2,R0,R1 

R1=X(I2)+R0 

11 

STF 

R1,*AR3++ 

X(I3)=-X(I2)+R0 


ADDF 

*AR1,R2,R1 

R1=X(I1)+R2 

! 1 

STF 

R1,*AR4— 

X(I4)=X(I2)+R0 


SUBF 

R2,*AR1,R1 

R1=X(I1)--R2 

1 1 

STF 

R1,*AR1+-H 

X(I1)=X(I1)+R2 

BLK3 

STF 

R1,*AR2 — 

X(I2)=X(I1)“R2 


SUBI 

@INPUT,AR5 



ADD I 

R3,AR5 

AR5=i+Nl 


CMP I 

@FFTSIZ,AR5 



BLED 

INLOP 

Loop back to the inner loop 


ADD I 

@INPUT,AR5 



NOP 




NOP 




ADD I 

1,R5 



CMP I 

@LOGFFT,R5 



BLE 

LOOP 


END 

BR 

END 

Branch to itself at the end. 


. end 


12-78 






Software Applications - Application-Oriented Operations 


12.4.5 


Table 12-1 summarizes the execution time required for FFT lengths between 
64 and 1024 points for the three algorithms in Examples 12-35, 12-37, and 
12-38. As can be seen, the TMS320C30 permits very fast execution of such 
transforms. FFT lengths up to 1024 points (complex) or 2048 points (real), 
covering the majority of applications, can be executed almost entirely In the 
on-chip memory. 

Table 12-1. TMS320C30 FFT Timing Benchmarks 


NUMBER 

OF 

POINTS 

RADIX-2 

(complex) 

FFT TIMING 
(in milliseconds) 
RADIX-4 
(complex) 

RADIX-2 

(real) 

64 

0.167 

0.123 

0.075 

128 

0.367 

- 

0.162 

256 

0.801 

0.624 

0.354 

512 

1.740 

- 

0.771 

1024 

3.750 

3.040 

1.670 


Lattice Filters 

The lattice form is an alternative way of Implementing digital filters, and it has 
found applications in speech processing, spectral estimation, and other areas. 
In the present discussion, the notation and the terminology from speech pro¬ 
cessing applications will be used. 

If H(z) is the transfer function of a digital filter that has only poles, A(z) = 
1/H(z) will be a filter having only zeros and It will be called the inverse filter. 
The inverse lattice filter Is shown In Figure 12-5. In mathematical terms, it Is 
described by the equations: 

f{Ln) = f{i-^,n) + k{i) 

b{in) = /?(/-1,A7-1)+ k(i) f(/-1,A?) 

Initial conditions: 

f{Q,n) = b{Q,n) = x{n) 

Final conditions: 

y(n) = f{p,n). 


f{i,n) is called the forward error, b{/,n) backward error, k{i) Is the /-th re¬ 
flection coefficient, x(n) is the Input, and y{n) the output signal. The order 
of the filter (i.e., the number of stages) is p. In the linear predictive coding 
(LPC) method of speech processing, the inverse lattice filter is used during 
analysis, and the (forward) lattice filter during speech synthesis. 
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Figure 12-5. Structure of the Inverse Lattice Filter 

Figure 12-6 shows the data memory organization of the inverse lattice-filter 
on the TMS320C30. 


reflection 


low 

address 

k(1) 


k(2) 


e 


• 


• 

high 

address 

1 ■‘■p’ 1 


backward 
propagation terms 

b(0, n-1) 
b(1, n-1) 




b(p-l. n-1) 


Figure 12-6. Data Memory Organization for Lattice Filters 
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Example 12-39. Inverse Lattice Filter 

* TITL INVERSE LATTICE FILTER 

* 

* 

* SUBROUTINE L A T I N V 

* 

* LATINV == LATTICE FILTER (LPC INVERSE FILTER - ANALYSIS) 

* 

* 

* TYPICAL CALLING SEQUENCE: 

* 


* 

load 

R2 

* 

load 

ARO 

* 

load 

ARl 

* 

load 

RC 

* 

CALL 

LATINV 


* 
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* 2 <= i <= p 

* 

LOOP 

*AR0,*++AR1(1),R0 ; k(i) * b(i-l,n-l) -> RO 

R2,R0,R2 ; f(+k(i-1) 

; *b( 

; = f(i-l,n) -> R2 

* 

* ; b(i-l-l,b-l)+k(i-l)*f 

ADDF3 *-ARl{l), Rl, R3 ; = b(i-l,n) -> R3 

II STF R3, *-ARl(l) ; b(i-l-l,n) -> b(i-l-l,n-l) 

* 

LOOP MPYF3 *AR0++(1),R2,R1 

* ; k(i) * f(i-l,n) -> R1 

* 

* I = P+1 (CLEANUP) 

ADDF3 R2,R0,R2 

* 

★ 

* 

ADDF35 ARl, Rl, R3 

I I STF R3, *AR1 

* 

* RETURN SEQUENCE 

* 

RETS 

* 

* end 

* 

. end 

The (forward) lattice filter has a structure very similar to the inverse filter, as 
shown In Figure 12-7. The corresponding equations describing the lattice 
filter are: 

/(/-I ,A7) = f{hn) - k(i) bU-^ ,nA) 
b{hn) = )+ k{i) f( /-1,/7) 

Initial conditions: 

f{p,n) = x( /?), b{hn-^ ) = 0 for /=1 
Final conditions: 
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y{n) = f{0,n). 

The data memory organization is identical to the one of the inverse filter, as 
shown In Figure 12-6. Example 12-40 is the implementation of the lattice 
filter on the TMS320C30. 


; f(p-l,n)+k(p)*b(p-l,n-l) 
; = f(p,n) -> R2 

; b(p-l,n-l)+k(p)*f(p-l,n) 

; = b(p,n) -> R3 

; b(p-l,n) -> b(p-l,n-l) 


; RETURN 


RPTB 
MPYF3 
I I ADDF3 

* 

* 
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x(n)=f(p. n) f(2.n) f(1,n) y(n) 



Figure 12-7. Structure of the (Forward) Lattice Filter 
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Example 12-40. Lattice Filter 

* TITL LATTICE FILTER 

* 

* 

* SUBROUTINE L A T I C E 


LOAD ARO 
LOAD ARl 
LOAD RC 
CALL LATICE 


ARGUMENT ASSIGNMENTS: 

ARGUMENT | FUNCTION!unction 
-+- 


* 

R2 

I F(P,N) 

= E(N) = EXCITATION 


* 

ARO 

I ADDRESS 

OF FILTER COEFFICIENTS 

(K(P) ) 

* 

ARl 

I ADDRESS 

OF BACKWARD PROPAGATION 

VALUES 

* 

RC 

RC = P ■ 

- 2 



REGISTERS USED AS INPUT: R2, ARO, ARl, RC 

REGISTERS MODIFIED: RO, Rl, R2, R3, RS, RE, RC, ARO, ARl 
REGISTER CONTAINING RESULT: R2 (f(0,n)) 

STACK USAGE: NONE 

PROGRAM SIZE: 12 WORDS 

EXECUTION CYCLES: 13 + 5 * (P-1) 

.global LATICE 


LATICE MPYF3 
* 

SUBF3 


2 <= I <= P 



RPTB 

MPYF3 

MPYF3 

ADDF3 


STF 

SUBF3 


*AR0, *AR1, RO 
R0,R2,R2 


LOOP 

*AR0,R2,R1 
*--AR0(l), 


*AR1--(1) , Rl 


R3, *+ARl(2) 
R0,R2,R2 


K(P) * B(P--1,N“1) -> RO 
ASSUME F(P,N) -> R2. 
F(P,N)~K(P)*B(P-1,N-1) 
=F(P“1,N) -> R2 


; K(I) * F(I-1,N) -> Rl 
*-ARl(l), RO 

K(I-l) * B(I-1--1,N-1) -> R 
R3 

B(I-1,N-1) + K(I) * F(I-1,N) 
= B(I,N) -> R3 
B(I,N) -> B(I,N-1) 
F(I-1,N)-K(I~1)*B(I-1-1,N-1) 
= F(I-1~1,N) -> R2 
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* 1=1 (CLEANUP) 

* 



MPYF3 

*ARO, R2, 

Rl ; 

K(l) * F(0,N) -> Rl 


ADDF3 

*AR1, Rl, 

R3 ; 

B(0,N-1) + K(l) * F(0,N) 

* 



} 

= B(1,N) -> R3 


STF 

R3, *+ARl(l) ; 

B(1,N) -> B(1,N~1) 

11 
•k 

STF 

R2, *AR1 

r 

F(0,N) -> B(0,N-1) 

* 

* 

RETURN SEQUENCE 




RETS 


r 

RETURN 
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12.5 Programming Tips 

Programming style is highly personal, and reflects each individuaTs prefer¬ 
ences and experiences. The purpose of this section is not to impose any par¬ 
ticular style. Instead, it intends to emphasize some of the features of the 
TMS320C30 that can help in producing faster and/or shorter programs. The 
following covers both C compiler and assembly language programming. 

12.5.1 C-Callable Routines 

The TMS320C30 was designed with a high-level language (HLL) in mind. 
The large register file, the software stack and the large memory space makes 
implementation of a HLL compiler an easy task. The first such implementation 
supplied is a C compiler. Use of the C compiler Increases the transportability 
of applications that have been tested on large, general-purpose computers, 
and decreases their porting time. 

For best usage of the compiler: 

1) Write the application in the high-level language. 

2) Debug the program. 

3) Estimate if It runs in real-time. 

4) If not, identify places where most of the execution time is spent. 

5) Optimize these areas by writing assembly language routines Imple¬ 
menting the functions. 

6) Call the routines from the C program as C functions. 

When writing a C program, a simple way to increase the execution speed is 
to maximize the use of register variables. For more information, refer to the 
TI\/IS320C30 C Compiler Reference Guide. 

There are certain conventions that have to be observed in writing a C-callable 
routine. These conventions are outlined in the Runtime Environment chapter 
of the "TMS320C30 C Compiler Reference Guide". Certain registers are saved 
by the calling function and others need to be saved by the called function. 
The C compiler manual will help achieve a very clean Interface. The end result 
Is the readability and natural flow of a high-level language combined with the 
efficiency and special-feature use of assembly language. 

12.5.2 Hints for Assembly Coding 

Each program will have its particular requirements. Not all possible optimiza¬ 
tions will make sense in every case. The suggestions presented in this section 
can be used as a checklist of available software tools. 

• Use delayed branches. Delayed branches take a single cycle to ex¬ 
ecute, regular branches take four. The following three instructions are 
also executed no matter if the branch is taken or not. If there are less 
than three instructions that could be used, use the delayed branch and 
append NOPs. Machine cycles (time) are still being saved. 

• Apply the repeat single/block construct. In this way, loops are 
achieved with no overhead. Nesting such constructs normally will not 
Increase efficiency, so try to use the repeat feature on the most often 
performed loop. Note that RPTS Is not interruptible, and the executed 
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instruction is not refetched for execution. This frees the buses for op¬ 
erands. 

• Use parallel instructions. It is possible to have a multiply in parallel 
with an add (or subtract), and stores in parallel with any multiply or 
ALU operation. This increases the number of operations executed in a 
single cycle. For maximum efficiency, observe the addressing modes 
used in parallel Instructions and arrange the data appropriately. 

• Maximize the use of registers. The registers are a very efficient, 
easy way to access scratch-pad memory. Extensive use of the register 
file will also help when using parallel Instructions and in avoiding the 
pipeline conflicts encountered using the registers in addressing modes. 

• Use the cache. Especially in conjunction with external slow memory. 
The cache is transparent to the user, so make sure that it is enabled. 

• Use internal memory instead of external memory. The internal 
memory (2K x 32 bits RAM and 4K x 32 bits ROM) Is considerably 
faster to access. In a single cycle, two operands can be brought from 
Internal memory. A way of maximizing performance is to use the DMA 
in parallel with the CPU to transfer data to Internal memory before op¬ 
erating on them. 

• Avoid pipeline conflicts. If there is no problem with program speed, 
ignore this suggestion. For time-critical operations, make sure that cy¬ 
cles are not missed because of conflicts. The way to identify such 
conflicts is to run the trace function on the development tools (simula¬ 
tor, emulators) with the program tracing option enabled. The tracing 
will Identify immediately the pipeline conflicts. Consulting the appro¬ 
priate section of this User's Guide will explain the reason for the con¬ 
flict. Steps can then be taken to correct the problem. 

The above checklist Is not exhaustive, and it does not address the more de¬ 
tailed features outlined In the different sections of this manual. To exploit the 
full power of the TMS320C30 It Is recommended that the architecture, hard¬ 
ware configuration, and Instruction set of the device, described in earlier 
chapters, be carefully studied. 
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Section 13 


Hardware Applications 


The TMS320C30's advanced interface design allows this device to be used to 
implement a wide variety of system configurations. Its two external buses and 
DMA capability provide a parallel 32-bit interface to byte- or word-wide de¬ 
vices, while the interrupt interface, dual serial ports, and general purpose dig¬ 
ital I/O provide communication with a multitude of peripherals. 

This section describes how to use the TMS320C30's interfaces to connect to 
various external devices. Specific discussions include Implementation of par¬ 
allel interface to devices with and without wait states, use of DMA and general 
purpose I/O, and multiprocessing considerations. 

Major topics discussed in this section are as follows: 

• System Configuration Options Overview (Section 13.1 on page 13-2) 

• Primary Bus Interface (Section 13.2 on page 13-4) 

- Zero Wait State Interface to RAMs 

- Ready Generation 

“ Bank Switching Techniques 

• Expansion Bus Interface (Section 13.3 on page 13-14) 

• System Control Functions (Section 13.4 on page 13-18) 

- Clock Oscillator Circuitry 

- Reset Slg^ial Generator 

• User Target Design Considerations When Using the XDS1000 (Section 
13.5 on page 13-22) 
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13.1 System Configuration Options Overview 

The various TMS320C30 interfaces allow connections to a wide variety of 
different device types. Each of these interfaces is tailored to a particular family 
of devices. 

13.1.1 Categories of Interfaces on the TMS320C30 

The interface types on the TMS320C30 fall Into several different categories 
depending on the devices to which they were Intended to be connected. Each 
interface comprises one or more signal lines which transfer information and 
control Its operation. Shown in Figure 13-1 are the signal line groupings for 
each of these various interfaces. 
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System 

Control 


< 
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Control 
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DMA <: 
Interface ^ 
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External 

Flags 
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Clock 
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XDO-31 
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Figure 13-1. External Interfaces on the TMS320C30 
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All of the interfaces are independent of one another and different operations 
may be performed simultaneously on each Interface. 

The Primary and Expansion buses Implement the memory mapped Interface to 
the device. The external DMA interface allows external devices to cause the 
processor to relinquish the Primary bus and allow direct memory access. 
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13.1.2 Typical System Block Diagram 

The devices which can be interfaced to the TMS320C30 include memory, 
DMA devices, and numerous parallel and serial peripherals and I/O devices. 
Figure 13-2 illustrates a typical configuration of a TMS320C30 system 
showing different types of external devices and the interfaces to which they 
are connected. 
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Figure 13-2. Possible System Configurations 


This block diagram constitutes more or less a fully expanded system. In an 
actual design any subset of the illustrated configuration may be used. 
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13.2 Primary Bus interface 

The primary bus is used by the TMS320C30 to access the majority of its me¬ 
mory mapped locations. Therefore, typically when a large amount of external 
memory Is required In a system, it is interfaced to the primary bus. The ex¬ 
pansion bus (discussed In the next subsection) actually comprises two mu¬ 
tually exclusive Interfaces, controlled by the MSTRB and lOSTRB signals 
respectively. Cycles on the expansion bus controlled by the MSTRB signal are 
Identical in timing to cycles on the primary bus, with the exception that bank 
switching is not Implemented on the expansion bus. Accordingly, the dis¬ 
cussion of primary bus cycles in this section applies equally to MSTRB cycles 
on the expansion bus. 

Although both the primary bus and the expansion bus may be used to inter¬ 
face to a wide variety of devices, the devices most commonly interfaced to 
these buses are memories. Therefore, detailed examples of memory interface 
will presented In this subsection. 

13.2.1 Zero Wait-State Interface To RAMs 

For full speed, zero wait-state Interface to any devices, the TMS320C30 re¬ 
quires a read access time of 35 ns from address stable to data valid. Since, for 
most memories, access time from chip select Is the same as access time from 
address. It is theoretically possible to use 35 ns memories at full speed with 
the TMS320C30. This, however, dictates that there be no delays present be¬ 
tween the processor and the memories. This Is usually not the case In practice, 
due to interconnection delays and the fact that typically some gating is re¬ 
quired for chip select generation. Therefore, slightly faster memories are gen¬ 
erally required In most systems. If one level of reasonably high-speed (below 
10 ns In propagation delay) gating is used to generate chip select for the me¬ 
mories, 25 ns devices may be used. 

Among currently available RAMs, there are two distinct categories of devices 
with different Interface characteristics. These two categories are RAMs with¬ 
out output enable control lines (OE), which include the 1 -bit wide organized 
RAMs and most of the 4-bit wide RAMs, and those with ^ controls, which 
Include the byte wide and a few of the 4-bit wide RAMs. Many of the fastest 
RAMs do not provide OE control, and use chip select (CS) controlled write 
cycles to insure that data outputs do not turn on for write operations. In CS 
controlled write cycles, the write control line (WE) goes low prior to ^ going 
low, and internal logic holds the outputs disabled until the cycle is completed. 
Using ^ controlled write cycles Is an efficient way to interface fast RAMs 
without OE controls to the TMS320C30 at full speed. 

Figure 13-3 shows the TMS320C30 interfaced to Cypress Semiconductor's 
CY7C164 25 ns 16k x 4-bit CMOS static RAMs with zero wait states using 
^ controlled write cycles. These RAMs are arranged to Implement 16k 32-blt 
words located at addresses OOOOOH thru 03FFFH, which are the first 16k 
words in external memory. Note that In Figure 13-3 the RDY input is tied low, 
selecting zero wait states for all accesses on the bus. 
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Figure 13-3. Ram Interface - No OE 


In this circuit, chip select Is generated from STRB and A23 using a 74AS32, 
whose propagation delay Is only 5.8 ns. Thus, the chip select delay added to 
the RAM's 25 ns chip select access time satisfies the TMS320C30's 35 ns read 
access time from address. This approach works well if only a single bank of 
external memory is Implemented where the chip select decode can be accom¬ 
plished In only one level of gating. If more than one bank is required to Im¬ 
plement very large memory spaces, bank switching can be used to provide for 
multiple bank select generation while still maintaining full speed accesses 
within each bank. Bank switching is discussed In detail In a later subsection. 
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Figures 13-4 and 13-5 show the read ar^d write timings of this interface, re¬ 
spectively. For read operations, WE (R/W) is inactive (high), and the device 
is selected whenever both STRB and A23 are low. The total time from address 
to data from the RAM is therefore: 

tacc = t1 + t2 = 5.8 + 25 = 30.8 ns 

This easily meets the TMS320C30's 35 ns access time requirement. For write 
operations, address and R/W change state far enough away in time from the 
low STI^ pulse to allow this interface to easily meet specifications for most 
RAMs' CS controlled write cycles. In this case, the CY7C164s outputs disable 
at the beginning of the cycle well early enough (t1 =7 ns) to avoid bus con¬ 
tention with the TMS320C30. Data is then driven into the ^Ms as STRB 
goes low. The RAMs require 13 ns of write data setup prior to CS going high, 
and this design provides around 65 ns (t2). A data hold time of 0 ns (t3) is 
required by the RAMs, and this design provides greater than 10 ns. Finally, 
the RAMs setup and hold times for address with respect to CS of 0 ns are also 
met with a clear margin. 

Some RAMs with OE controls can also use ^ controlled wite cycles and this 
Interface may be used with some of these devices with OE tied low. There 
are, however, two requirements for the use of OE RAMs with this interface. 
First, the RAM's OE Input must be gated with chip select and WE internally 
so that the device's outputs do not turn on unless a read is being performed. 
Secondly, the RAM must allow address inputs to change while WE is low, 
which some RAMs specifically prohibit. 

Many RAMs with OE controls that do not meet the design criteria for the cir¬ 
cuit shown in Figure 13-3 may be Interfaced to the TMS320C30 using the 
approach shown in Figure 13-6 



Figure 13-6. RAM Interface - OE 
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This design shows an interface to Integrated Device Technology's IDT7198 
25 ns 16k X 4-bit CMOS static RAMs using M to enable and disable the data 
outputs. 

In this circuit chip select is driven directly by a single address line, which lo¬ 
cates the RAM at addresses OOOOOH through 03FFFH In external memory. 
The RAM's WE input is generated by AN Ding R/W and STRB, and therefore 
WE goes low after ^ only during write cycles. This satisfies the RAM's re¬ 
quirement that address never changes when WE is low. 

The timing of read operations, shown in Figure 13-7, is very straightforward 
since ^ is driven directly. The read access time of the circuit t1, is therefore 
simply the RAM's chip select/address access time, which is 25 ns. This pro¬ 
vides 10 ns of margin over the TMS320C30's 35 ns requirement. 
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Figure 13-7. Read Operations Timing 
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Figure 13-8. Write Operations Timing 


During write operations, as shown in Figure 13-8, the RAM's outputs are 
disabled after a delay of t1 following R/W going low. This delay comprises 
the inverter propagation delay and the RAM's turn-off delay, therefore t1 is 
given by; 

t1 = 5 + 15 = 20 ns 


which results In the outputs being disabled no later than the falling edge of 
HI, thereby avoiding bus contention with the TMS320C30. The circuit's data 
setup and hold times of approximately 65 and 10 ns, respectively also easily 
meet the RAM's timing requirements. 

As with the circuit of Figure 13-3, If more complex chip select decode is re¬ 
quired than can be accomplished in time to meet zero wait state timing, wait 
states or bank switching techniques (discussed in a later subsection) should 
be used. 


It should be noted that the IDT7198's OE control Is gated with ^ internally, 
therefore the RAM's outputs are not enabled unless the device is selected. 
This is critical if there are any other devices connected to the same bus; if there 
are no other devices connected to the bus, then OE need not be gated Inter¬ 
nally with chip select. 
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13.2.2 Ready Generation 

The use of wait states can greatly increase system flexibility and reduce hard¬ 
ware requirements over systems without wait state capability. The 
TI\/IS320C30 has the capability of generating wait states on either the primary 
bus or the expansion bus; both buses have independent sets of ready control 
logic. Ready generation is discussed in this subsection from the perspective 
of the primary bus interface, however, wait state operation on the expansion 
bus is identical to that of the primary bus, therefore these discussions pertain 
equally well to expansion bus operation. Ready generation will not be in¬ 
cluded in the specific discussions of the expansion bus interface. 

Walt states are gener ated on the basis of the internal wait state generator, the 
external ready input (RDY), or the logical AND or OR of the two (see Section 
8.3). When enabled. Internally generated wait states effect all external cycles, 
regardless of the address accessed. If different n umbe rs of wait states are re¬ 
quired for various external devices, the external RDY input may be used to 
tailor wait state generation to specific system requirements. 

If the logical OR (or electrical AND since the signals are low true) of the ex¬ 
ternal and wait count ready signals Is selected, the earlier of either of the two 
signals will generate a ready condition and allow the cycle to be completed. 
It Is not required that both signals be present. 

The OR of the two ready signals can be used to implement wait states for 
devices which require a greater number of wait states than are Implemented 
with external logic (up to eight). This feature Is useful, for example, if a sys¬ 
tem contains some fast and some slow devices. In this case, fast devices can 
generate ready externally with a minimum of logic, and slow devices can use 
the internal wait counter for larger numbers of wait states. Thus, when fast 
devices are accessed, the external hardware responds promptly with ready 
which terminates the cycle. When slow devices are accessed, the external 
hardware does not respond, and the cycle Is appropriately terminated after the 
Internal wait count. 

The OR of the two ready signals may also be used if conditions occur which 
require termination of bus cycles prior to the number of wait states Imple¬ 
mented with external logic. In this case, a shorter wait count is specified in¬ 
ternally than the number of wait states Implemented with the external ready 
logic, and the bus cycle Is terminated after the wait count. This feature may 
also be used as a safeguard against Inadvertent accesses to nonexistent me¬ 
mory which would never respond with ready and therefore lock up the 
TMS320C30. 

If the OR of the two ready signals is used, however, and the internal wait state 
count is less than the number of wait states implemented externally, the ex¬ 
ternal ready generation logic must have the ability to reset its sequencing to 
allow a new cycle to begin immediately following the end of the internal wait 
count. This requires that, under these conditions, consecutive cycles must be 
from independently decoded areas of memory and that the external ready 
generation logic be capable of restarting Its sequence as soon as a new cycle 
begins. Otherwise, the external ready generation logic may loose synchroni¬ 
zation with bus cycles and therefore generate Improperly timed wait states. 
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If the logical AND (electrical OR) of the wait count and external ready signals 
is selected, the later of the two signals will control the internal ready signal, 
but both signals must occur. Accordingly, external ready control must be im¬ 
plemented for each wait state device in addition to the wait count ready signal 
being enabled. 

This feature is useful if there are devices in a system which are equipped to 
provide a ready signal but cannot respond quickly enough to meet the 
TMS320C30's timing requirements. In particular, if these devices normally 
indicate a ready condition and, when accessed, respond with a wait until they 
become ready, the logical AND of the two ready signals can be used to save 
hardware in the system. In this case, the internal wait counter can be used to 
provide wait states initially, and become ready after the external device has 
had time to send a not ready indication. The internal wait counter then re¬ 
mains ready until the external device also becomes ready, which terminates the 
cycle. 

Additionally, the AND of the two ready signals may be used for extending the 
number of wait states for devices which already have external ready logic im¬ 
plemented but require additional wait states under certain unique circum¬ 
stances. 

In the Implementation of external ready generation hardware, the particular 
technique employed depends heavily on the specific characteristics of the 
system. The optimum approach to ready generation varies depending on the 
relative number of wait state and non-wait state devices in the system and the 
maximum number of wait states required for any one device. The approaches 
discussed here are intended to be general enough for most applications, and 
are easily modifiable to comprehend many different system configurations. 

In general, ready generation Involves the following three functions: 

1) Segmentation of the address space In some fashion to distinguish fast 
and slow devices. 

2) Generate properly timed ready Indications. 

3) Logically ORing all of the separate ready timing signals together to 
connect to the physical ready input. 

Segmentation of the address space Is required so that a unique Indication of 
each of the particular areas within the address space that require wait states 
can be obtained. This segmentation is commonly implemented in a system in 
the form of chip select generation. Chip select signals may be used to initiate 
wait states in many cases, however, occasionally chip select decoding con¬ 
siderations may provide signals which will not allow ready Input timing re¬ 
quirements to be met. In this case, coarse address space segmentation may 
be made on the basis of a small number of address lines, where simpler gating 
allows signals to be generated more quickly. In either case, the signal indi¬ 
cating that a particular area of memory is being addressed is normally used to 
Initiate a ready or wait state indication. 

Once the region of address space being accessed has been established, a 
timing circuit of some sort is normally used to provide a ready indication to the 
processor at the appropriate point in the cycle to satisfy each device's unique 
requirements. 
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Finally, since indications of ready status from multiple devices are typically 
present, an OR gate is commonly used to combine the signals to drive the RDY 
input 

One of two basic approaches may be taken in the implementation of ready 
control logic depending upon the state in which the ready input is to be be¬ 
tween accesses. If RDY is low between accesses, the processor is always 
ready unless a wait state is required; if RDY is high between accesses, the 
processor will always enter a wait state unless a ready Indication Is generated. 

If RDY is low between accesses, control of full speed devices Is straightfor¬ 
ward; no action is necessary since ready is always active unless otherwise re¬ 
quired. Devices requiring wait states, however, must drive ready high fast 
enough to meet the input timing requirements. Then, after an appropriate de¬ 
lay, a ready indication must be generated. This can be quite difficult in many 
circumstances since wait state devices are Inherently slow and often require 
complex select decoding. 

If RDY is high between accesses, zero wait state devices, which tend to be 
inherently fast, can usually respond immediately with a ready indication. Walt 
state devices may simply delay their select signals appropriately to generate a 
ready. Typically, this approach results in the most efficient Implementation of 
ready control logic. Figure 13-9 shows a circuit of this type which can be used 
to generate 0, 1, or 2 wait states for multiple devices in a system. 



Hardware Applications - Primary Bus Interface 


74ALS138 



Figure 13-9. Circuit For Generation of 0, 1, or 2 Wait States For 

Multiple Devices 


In this circuit full speed devices drive ready directly through the 74AS21, and 
the two flip-flops delay wait state devices' select signals one or two H1 cycles 
to provide 1 or 2 wait states. 

Considering the TMS320C30's ready delay time of 8 ns following address, 
zero wait state devices must use ungated address lines directly to drive the 
input of the 74AS21, since this gate contributes a maximum propagation delay 
of 6 ns to the RDY signal. Thus, zero wait state devices should be grouped 
together within a coarse segmentation of address space if other devices In the 
system require wait states. 

With this circuit, devices requiring wait states may take up to 42 ns from a 
valid address on the TMS320C30 to provide inputs to the 74AS20's Inputs. 
Typically, this allows sufficient time for any decoding required In generating 
select signals for slower devices in the system. For example, the 74ALS138 
driven by address and STRB, can generate select decodes in 22 ns, which ea¬ 
sily meets the TMS320C30's timing requirements. 
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With this circuit unused inputs to either the 74AS20s or the 74AS21 should 
be tied to a logic high level to prevent noise from generating spurious wait 
states. 

If more than 2 wait states are required by devices within a system, other ap¬ 
proaches may be employed for ready generation. If between three and eight 
wait states are required, additional flip-flops may be Included, In the same 
manner as shown in Figure 13-9, or internally generated wait states may be 
used in conjunction with external hardware. If greater than eight wait states 
are required, an external circuit using a counter may be used to supplement the 
internal wait state generators capabilities. 

13.2.3 Bank Switching Techniques 

The TMS320C30's programmable bank switching feature can greatly ease 
system design when large amounts of memory are required. This feature is 
used to provide a period of time during which all device selects are disabled 
that would not normally be present otherwise (refer to Section 8.4 for further 
information regarding bank switching). During this interval, slow devices are 
allowed time to turn off before other devices have the opportunity to drive the 
data bus, thus avoiding bus contention. 

When bank switching Is enabled, any time a portion of the high order address 
lines change, as defined by the contents of the BNKCMPR register, STRB goes 
high for one full HI cycle. Provided STRB Is included in chip select decodes, 
this causes all devices to be disabled during this period. The next bank of 
devices is not enabled until STRB goes low again. 

Bank switching Is not required during writes since these cycles always exhibit 
an Inherent one-half HI cycle setup of address Information before STRB goes 
low. Thus, when using bank switching for read/write devices, a minimum of 
half of one HI cycle of address setup Is provided for all accesses. Therefore, 
large amounts of memory can be implemented without wait states or extra 
hardware required for Isolation between banks. Also, note that access time for 
cycles during bank switching Is the same as that of cycles without bank 
switching, and accordingly, full speed accesses may still be accomplished 
within each bank. 

The circuit shown in Figure 13-10 illustrates the use of bank switching with 
Cypress Semiconductor's CY7C185 25 ns 8kx8 CMOS static RAM. This cir¬ 
cuit implements 32k 32-bit words of memory with full speed zero wait state 
accesses within each bank. 
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TMS320C30 



Figure 13-10. Bank Switching For Cyprus Semiconductors CY7C185 


Each of the four banks in this circuit is selected using a decode of A15-A13 
generated by the 74AS138. With the BNKCMPR register set to >0Bh, the 
banks will be selected on even 8k word boundaries starting at location zero 
in external memory space. 

This circuit could not have been implemented without bank switching, since 
data output's turn-on and turn-off delays would have caused bus conflicts, 
and full speed accesses do not allow enough time for chip select decoding for 
the four banks. Here, the propagation delay of the 74AS138 is only involved 
during bank switches, where there is sufficient time between cycles to allow 
new chip selects to be decoded. 

The timing of this circuit for read operations using bank switching is shown 
in Figure 13-11 With the BNKCMPR register set to >0Bh, when a bank 
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switch occurs, the bank address on address lines A23-A13, is updated during 
the extra HI cycle while STRB is high. Then, after chip select decodes have 
stabilized, and the previously selected bank has disabled its outputs, STRB 
goes low for the next read cycle. Further accesses occur at full speed with the 
normal bus timings, as long as another bank switch Is not necessary. Write 
cycles do not require bank switching due to the inherent address setup pro¬ 
vided in their timings. 
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Figure 13-11. Timing For Read Operations Using Bank Switching 


This timing is summarized In Table 13-1. 

Table 13-1. Bank Switching Interface Timing 


Time 

Interval 

Event 

Time 

Period 

ti 

Hi falling to address/STRB valid 

10 ns 

t2 

STRB to select delay 

4.5 ns 

t3 

Memory disable from select 

15 ns 

t4 

HI falling to STRB 

10 ns 

t5 

STRB to select delay 

4.5 ns 

t6 

Memory output enable delay 

3 ns 
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13.3 Expansion Bus Interface 

The TMS320C30s expansion bus interface provides a second complete paral¬ 
lel bus which can be used to implement data transfers concurrently with, and 
independent of, operations on the primary bus. The expansion bus comprises 
two mutually exclusive interfaces controlled by the MSTRB and lOSTRB sig¬ 
nals, respectively. These two signals are activated depending on what section 
of the memory space Is accessed. This subsection discusses Interface to the 
expansion bus using lOSTRB ; MSTRB cycles are identical in timing to primary 
bus cycles, and are discussed in Section 13.2. 

Unlike thre primary bus, both read and write cycles on the I/O portion of the 
expansion bus are two HI cycles In duration and exhibit the same timing. Thr 
XR/W signal is high for reads and low for writes. Since I/O accesses take two 
cycles, many peripherals that require wait states if interfaced either to the pri¬ 
mary bus or using MSTRB may be used In a system without the need for wait 
states. Specifically, any devices with address access times greater than the 
35 ns required by the primary bus but not less than 46 ns can be Interfaced 
to the I/O bus without wait states. 

A/D converters are one common DSP system component which often falls 
into this category. These devices are available In many speed ranges and with 
a variety of features, and while some may require one or more wait states on 
the I/O bus, others may be used at full speed. 

One A/D converter that interfaces to the I/O bus without wait states and re¬ 
quires minimal additional logic is the ad 1332 from Analog Devices. Figure 
13-12 Illustrates an Interface to this device. 


13 


13-17 



Hardware Applications - Expansion Bus Interface 


74ALS138 



Figure 13-12. Expansion Bus Interface to A/D Converter 
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The interface uses a 74ALS138 to decode chip select for the converter. This 
configuration is shown assuming that other peripheral devices in the system 
also require chip select decodes. XA(8-10) are decoded to locate the con¬ 
verter at address 0804000h, which is the beginning of the I/O address space. 
Other peripherals may also use the outputs of the decoder, which generates 
chip selects In the I/O address space on 256 word boundaries. 

XAO Is used to drive the single address line required In interfacing to the con¬ 
verter. This Input selects between an Internal 32-word FIFO buffer and the 
A/D's control/status register. Thus, the FIFO is located at address 0804000h 
and the control/status register Is located at address 0804001 h. 
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Since the converter requires RD and WR control signals rather th an WE and 
OE, random logic is u sed to generate these signals from lOSTRB and XR/W. 
The converter's IRQ (Interrupt Request) output is used to alert the 
TMS320C30 to various conditions of converter status. 

Figure 13-13 shows the timing for read and write operations between the 
TMS320C30 and the ADI 332. Both operations are shown on the same tim¬ 
ing diagram since, unlike the primary bus, only data bus timing and the state 
of XR/W differ between the two different types of cycles. 
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Figure 13-13. Timing of Expansion Bus Interface 


In both cases, address and R/W are valid t1 = 10 ns after the falling edge of 
HI. After t2 = 17 ns, the propagation delay of the 74ALS138, the A/D con¬ 
verter's chip select goes lo w, selecting the device. Then, t3 = 10 ns after the 
rising edge of HI, lOSTRB goes low, and t4 = 5.8 ns following this, the RD or 
WR signal to the converter goes low. Initiating either a read or write cycle, re¬ 
spectively. 

For a read operation, the A/D converter provides data back to the TMS320C30 
t4 + t5 = 30.8 ns after RD goes low. This satisfies the TMS320C30's re¬ 
quirement of having data valid 35 ns after lOSTRB. For write operations, the 
A/D converter requires less than 5 ns of data setup and hold time with respect 
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to the rising edge of WR. This is met with a high degree of margin by the 
TMS320C30. 

It should be noted that for the AD1332's FIFO to be clocked properly, the Wd 
signal must go high between accesses to the device. Therefore, although the 
ADI 332 may be fast enough In some cases to be used at speeds approaching 
those of the primary bus, the STRB signal on the primary bus stays low for 
multiple consecutive read cycles. The I/O bus, therefore. Is the preferable 
choice for Interface to this device. 
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13.4 System Control Functions 

There are several aspects of TMS320C30 system hardware design which are 
critical to overall system operation. These include such functions as clock and 
reset signal generation and interrupt control. 

13.4.1 Clock Oscillator Circuitry 

An input clock may be provided to the TMS320C30 either from an external 
clock input or by using the on-board oscillator. Unless special clock require¬ 
ments exist using the on-board oscillator is generally a convenient method 
of clock generation. This method requires few external components and can 
provide stable, reliable clock generation for the device. 

Figure 13-14 shows a clock generator circuit using the internal oscillator. This 
circuit is designed to operate at 33.33 MHz and since crystals with funda¬ 
mental oscillation frequencies of 30 MHz and above are not readily available, 
a parallel-resonant third-overtone circuit is used. 


TMS320C30 

XI_ X2/CLKIN 

I 33.33 MHz I 


47 pF 




20 pF 




2.6 iiH 


V 


Figure 13-14. Crystal Oscillator Circuit 


In a third-overtone oscillator, the crystal fundamental frequency must be at¬ 
tenuated so that oscillation is at the third harmonic. This is achieved with an 
LC circuit that filters out the fundamental, thus allowing oscillation at the third 
harmonic. The impedance of the LC network must be inductive at the crystal 
fundamental and capacitive at the third harmonic. The Impedance of the LC 
circuit is given by: 


L 

C 



(3) 
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Therefore, the LC circuit has a pole at: 

1 

ujp = VIZ (4) 

At frequencies significantly lower than u)p, the 1 /(ujC) term in (3) becomes 
the dominating term, while wL cari be neglected. This gives: 

z(uj) =jujL for w << Up (5) 

In (5), the LC circuit appears inductive at frequencies lower than Wp. On the 
other hand, at frequencies much higher than ujp, the wL term is the dominant 
term in (3), and 1/(u)C) can be neglected. This gives: 

J_ 

z(u)) = jwC foruj>>ujp (6) 

The LC circuit in (6) appears increasingly capacitive as frequency increases 
above Wp. This is shown in Figure 13-15, which Is a plot of the magnitude 
of the Impedance of the LC circuit of Figure 13-14 versus frequency. 



Figure 13-15. Magnitude of the Impedance of the Oscillator LC 

Network 
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Based on the discussion above, the design of the LC circuit proceeds as fol¬ 
lows: 

1) Choose the pole frequency ojp approximately halfway between the crys¬ 
tal fundamental and the third harmonic. 

2) The circuit now appears Inductive at the fundamental frequency and 
capacitive at the third harmonic. 

In the oscillator of Figure 13-14, choose tUp = 22.2 MHz, which is approxi¬ 
mately halfway between the fundamental and the third harmonic. Choose C 
= 20 pF. Then, using (4), L = 2.6 pH. 

13.4.2 Reset Signal Generation 

The reset input controls initialization of internal TMS320C30 logic and also 
causes execution of the system initialization software. For proper system In¬ 
itialization, the reset signal must be applied at least ten HI cycles, i.e., 600 ns 
for a TMS320C30 operating at 33.33 MHz. Upon powerup, however, it can 
take 20 ms or more before the system oscillator reaches a stable operating 
state. Therefore, the powerup reset circuit should generate a low pulse on the 
reset line for 100 to 200 ms. Once a proper reset pulse has been applied, the 
processor fetches the reset vector from location zero which contains the ad¬ 
dress of the system initialization routine. Figure 13-16 shows a circuit which 
will generate an appropriate powerup reset signal. 


TMS320C25 
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The duration of the low pulse on the reset pin is approximately ti, which is the 
time it takes for the capacitor C-ito be charged to 1.5 V. This is approximately 
the voltage at which the reset input switches from a logic 0 to a logic 1. The 
capacitor voltage is given by: 


V 



(7) 


where t = R-| C-i is the reset circuit time constant. Solving (7) for t gives: 


V 

t = -R'iC'i 1n 1 - Vcc_ 


( 8 ) 


Setting the following: 


Rl = 1 MO 
Cl = 0.47 mF 
Vcc = 5 V 
V = Vi = 1.5 V 

gives t = 167 ms. Therefore, the reset circuit of Figure 13-16 provides a low 
pulse of long enough duration to ensure the stabilization of the system oscil¬ 
lator upon powerup. 

Note that If synchronization of multiple TMS320C30's Is required, all proces¬ 
sors should be provided with the same Input clock and the same reset signal. 
After powerup, when the clock has stabilized, all processors may then be 
synchonized by generating a falling edge on the common reset signal. Since 
it is in fact the falling edge of reset that establishes synchronization, reset must 
be high for a period of time (at least ten HI cycles) initially. Following the 
falling edge, reset should remain low for at least ten HI cycles and then be 
driven high. This sequencing of reset may be accomplished using additional 
circuitry, based on either RC time delays or counters. 
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13.5 XDS1000 Target Design Considerations 

The TMS320C30 Emulator is an Extended Development System (XDS1000), 
which has all the features necessary for full-speed emulation. The 
TMS320C30 uses a revolutionary technology to allow complete emulation via 
a serial scan path. If the user provides a 12 pin header on their target system, 
realtime emulation can be performed using the TMS320C30 device in their 
target system. Refer to Appendix B, Section B.1.4 for a complete description 
of theXDSIOOO. 

To use the emulation connector of the XDS1000, the signals shown In Figure 
13-18 should be provided to a 12 pin header (two rows of six pins) with pin 
8 cut out to provide keying. 


EMUIt 

1 

2 

GND 

EMUO^ 

3 

4 

GND 

EMU2t 

5 

6 

GND 

PD ( + 5 V) 

7 


NO PIN (KEY) 

EMU3 

9 

10 

GND 

H3 

11 

12 

GND 


HEADER DIMENSIONS: 

PIN TO PIN SPACING 0.100 IN. (X,Y) 
PIN WIDTH 0.025 IN. SQUARE POST 
PIN LENGTH 0.235 IN NOMINAL 


TOP VIEW 


I^These signals should always be pulled up with separate 20 kO resistors to +5 volts on the TMS320C30. 


Figure 13-18. 12 Pin Header Signals 


Signal Description: 

EMUO Emulation pin 0. 

EMU1 Emulation pin 1. 

EMU2 Emulation pin 2. 

EMUS Emulation pin 3. 

H3 TMS320C30 H3 

PD Presence detect. It indicates that the cable Is connected 

and target system is powered up. It 
should be tied to +5 volts In the target 
system. 


13-26 




Hardware applications - Target Design Considerations 


Figure 13-19 is a diagram of the typical setup when using the emulation 
connection of the XDS1000. 



Figure 13-19. Typical Setup For Using the Emulation Connection of the 

XDS1000 


For unbuffered signals, the distance between the TMS320C30 emulation pins 
(EMUO, EMU1, EMU2, EMU3, and H3) and the 12 pin header should be less 
than two inches. If the distance between the header and the TMS320C30 
emulation pins is more than two Inches but less than six Inches, the EMU3 
and H3 signals should be buffered. The buffer should be noninverting with a 
worst case propagation delay of 6.0 ns. For TMS320C30 emulation pins to 
12 pin header distances greater than six Inches, all emulation signals should 
be buffered. Recall that EMUO, EMU1, and EMU2 are inputs and EMU3 and 
H3 are outputs. The buffer should have the same characteristics as given 
above. 
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Appendix A 

TMS320C30 Timing Specifications & Dimensions 


This section provides timing specifications and dimensions for the 
TMS320C30 (third-generation TMS320) processor. In order to provide in¬ 
formation in advance of the complete data sheet, this section is included. 
Characterization data on the TMS320C30 is still being collected. A complete 
data sheet with additional Information will be available in the future. Please 
contact the local Tl field sales office to obtain these data sheets. 
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Table A- 


. Absolute Maximum Ratings Over Specified Temperature Range 


Condition/Characteristic 

Range 

Supply voltage range, Vqd 

-0.3 V to 7 V 

Input voltage range 

-0.3 V to 7 V 

Output voltage range 

-0.3 V to 7 V 

Continuous power dissipation 

2.0 W 

Operating free-air temperature range 

OX to 70X 

Storage temperature range 

-55Xto 150X 


Notes: 

1) Stresses beyond those listed under 'Absolute Maximum Ratings' may 
cause permanent damage to the device. This is a stress rating only and 
functional operation of the device at these or any other conditions be¬ 
yond those indicated In the 'Recommended Operating Conditions' sec¬ 
tion of this specification Is not Implied. Exposure to 
absolute-maximum-rated conditions for extended periods may affect 
device reliability. 

2) All voltage values are with respect to Vss- 

Table A-2. Recommended Operating Conditions 


Operating Condition 

Min 

Nom Max 

Unit 

Vdd Supply voltages (DVDD, etc.) 

4.75 

5 5.25 

V 

Vss Supply voltages (CVSS, etc.) 

0 

V 

V|H High-level Input voltage 

2 

Vdd 0.3 

V 

V|L Low-level input voltage 

-0.3 

0.8 

V 

Iqh High-level output current 

300 

mA 

Iql Low-level output current 

2 

mA 

T Operating free-air temperature 

0 

70 

X 
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Table A-3. Electrical Characteristics Over Specified Free-Air Temperature 

Range 


Electrical Characteristic 

Min Nom Max 

Unit 

Vqh High-level output voltage (Vqd = Min,I qh " Max) 

2.4 3 

V 

Vql Low-level output voltage (Vdd = Min,I ql Max) 

0.3 0.6 

V 

\2 Three-state current (V[)q = Max) 

-20 20 

mA. 

Ij Input current (V| = Vss toV qq) 

-10 10 

mA 

Ice Supply current (Ta = 25“C,Vdd= Max, f x = Max) 

300 

mA 

C| Input capacitance 

15 

pF 

Cq Output capacitance 

15 

pF 


Notes: 

1) All typical values are at Vqq = 5 V, Ta = 25°C. 

2) fx is the input clock frequency. The maximum value Is 33.3 MHz. 

3) All input and output voltage levels are TTL compatible. 


2.15 V 


FROM OUTPUT 
UNDER TEST 



Figure A-1. Test Load Circuit 
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Figure A-3. HI/H3 Timing 




Appendix A - TMS320C30 Dimensions and Timing Specifications 


Table A-4. Switching Characteristics for CLKIISl, H1, and H3 



No. 

Name 

Description 

(1) 

tf(CI) 

CLKIN fall time 

(2) 

tw(CIL) 

CLKIN low pulse duration 
tc(CI) = 30 ns 

(3) 

tw(CIH) 

CLKIN high pulse duration 
tc(CI) = 30 ns 

(4) 

tr(CI) 

CLKIN rise time 

(5) 

tc(C!) 

CLKIN cycle time 

(6) 

tf(H) 

H1/H3 fall time 

(7) 

tw(HL) 

H1/H3 low pulse duration 

(8) 

tw(HH) 

H1/H3 high pulse duration 

(9) 

tr(H) 

H1 /H3 rise time 

(9.1) 

td(HL- HH) 

Delay from HI (H3) low to 
H3(H1) high 

(9.2) 

td(HH - HL) 

Delay from HI (H3) high to 
H3(H1) low 

(10) 

tc(H) 

H1/H3 cycle time 


Min Typ Max Unit 



Note: P = tc(CI) 

Table A-5. Switching Characteristics for a memory ((M)STRB = 0) read 


No. 

Name 

Description 

Min Typ Max 

Unit 

(11) 

td(H1L- (M)SL) 

HI low to (M)STRB low 

0 10 

ns 

(12) 

td(H1L - (M)SH) 

HI low to (M)STRB high 

0 10 

ns 

(13) 

td(H1H - (lO)RWL) 

H1 high to (IO)R/Wlow 

0 10 

ns 

(14) 

td(H1L- (IO)A) 

HI low to (IO)A valid 

0 10 

ns 

(15) 

tsu((IO)D)R 

(10)D valid before HI low 
(read) 

15 

ns 

(16) 

th((IO)D)R 

(10)D hold time after HI 

0 

ns 

(17) 

tsu((IO)RDY) 

(lO)RDY valid before HI 
high 

8 

ns 

(18) 

th((IO(RDY) 

(lO)RDY hold time after 

HI high 

0 

ns 


A-D 
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Figure A-4. Memory ((M)STRB = 0) Read 


Table A-6. Switching Characteristics for a memory ((M)STRB = 0) Write 


No. 

Name 

Description 

Min Typ Max 

Unit 

(19) 

tc|(H1H - (lO)RWH) 

H1 high to (IO)R/Whigh 

10 

ns 

(20) 

M(IO)D)W 

(I0)D valid after H1 low 
(write) 

20 

ns 

(21) 

th((IO)D)W 

(I0)D hold time after H1 
high (write) 

0 

ns 


A-D 
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Figure A-5. Memory ((M)STRB = 0) Write 
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Table A-7. Switching Characteristics for a Memory (lOSTRB = 0) Read 


No. 

Name 

Description 

Min Typ Max 

Unit 

(11.1) 

td(H1H - lOSL) 

HI high to lOSTRB low 

0 10 

ns 

(12.1) 

td(H1H - lOSH) 

HI high to lOSTRB high 

0 10 

ns 

(22) 

td(H1L - lORWH) 

HI low to lOR/W high 

0 10 

ns 

(14.1) 

td(H1L - lOA) 

HI low to lOA valid 

0 10 

ns 

(15.1) 

tsu(IOD)R 

lOD valid before HI high 
(read) 

15 

ns 

(16.1) 

th(IOD)R 

10D hold time after HI 
high (read) 

0 

ns 

(17.1) 

tsu(IORDY) 

lORDY valid before HI high 

8 

ns 

(18.1) 

th(IORDY) 

lORDY hold time after HI 
high 

0 

ns 


Table A-8. Switching Characteristics for a Memory (lOSTRB = 0) Write 


No. 

Name 

Description 

Min Typ Max 

Unit 

(23) 

td(H1L - lORWL) 

HI low to (IO)R/Wlow 

0 10 

ns 

(20.1) 

tv(IOD)W 

10D valid before HI low 
(write) 

15 

ns 

(21.1) 

tv(IOD)W 

lOD hold time after HI low 
(write) 

0 

ns 
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Figure A-8. Timing for XFO and XF1 When Executing a LDFI or LDII 
Table A-9. Information for Figure A-8 


No. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

td(H3H-XF0L) 

H3 high to XFO low 

10 

ns 

(2) 

tsu(XF1) 

XF1 valid before HI low 

8 

ns 

(3) 

th(XFI) 

XF1 hold time after HI low 

0 

ns 
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XFO PIN 


Figure A-9. Timing for XFO When Executing a STFI or STII 


Table A-10. Information for Figure A-9 


No. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

td(H3H-XF0H) 

H3 high to XFO high 

10 

ns 


A-D 


A-12 
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Figure A-10. Timing for XFO and XF1 When Executing SIGI 


Table A-11. Information for Figure A-10 


No. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

tc|(H3H-XF0L) 

H3 high to XFO low 

10 

ns 

(2) 

tci(H3H-XF0H) 

H3 high to XFO high 

10 

ns 

(3) 

tsu(XF1) 

XF1 valid before H1 low 

8 

ns 

(4) 

th(XF1) 

XF1 hold time after H1 low 

0 

ns 


A-D 
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FETCH LOAD 

I INSTRUCTION | DECODE | READ | EXECUTE 1 



Figure A-11. Timing for Loading XF Register When Configured as an Output 

Pin 

Table A-12. Information for Figure A-11 


No. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

tv(H3H-XF) 

H3 high to XF valid 

10 

ns 


A-D 


A^14 
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Figure A-12. Change of XF From Output to Input Mode 


Table A-13. Information for Figure A-12 


IMo. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

tci(H3H-XF0l) 

H3 high to XF switching 
from output to input 

15 

ns 

(2) 

tsu(XF) 

XF setup before H1 low 

10 

ns 

(3) 

th(XF) 

XF hold before H1 low 

0 

ns 


A-D 


A-15 
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EXECUTION OF 
LOAD OF lOF I 



Figure A-13. Change of XF From Input to Output Mode 


Table A-14. Information for Figure A-13 


No. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

td(H3H-XFI0) 

H3 high to XF switching 
from input to output 

10 

ns 


A-D 


A-16 
















A-17 



(NOTE 5) 
HI 


H3 


(X)D 
(NOTE 1) 

(X)A 
(NOTE 2 ) 

CONTROL 
SIGNALS (NOTE 3) 

iACK 

ASYNCHRONOUS 
RESET SIGNALS 
(NOTE 4) 



NOTES: 1. (X)D includes D(31-0) and XD(31-0). 

2. X(A) includes A(23-0) and X A(12-0 ).__ 

3. Control signals include R/W, STRB, XR/W, MSTRB, and lOSTRB. 

4. Asynchronously reset signals include XF1, XFO, CLKXO, DXO, FSXO, CLKRO, DRO, FSRO, CLKX1, DX1, FSX1, CLKR1, DR1, 
FSRI^TCLKO, and TCLK1. 

5. RESET is an asynchronous input. 


> 

I 

a 


Figure A-14. RESET Timing 
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Table A-15. Information for Figure A-14 


No. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

tsu(RESET) 

Setup for RESET 
before CLKIN low 

10 

ns 

(2) 

td(CLKINH-HIH) 

CLKIN high to H1 high 

0 15 

ns 

(3) 

tsu(RESETH-HIL) 

Setup for RESET high 
before HI low and after 10 

H1 clock cycles 

10 

ns 

(5) 

td(CLKINH-H3L) 

CLKIN high to H3 low 

0 15 

ns 

(8) 

tdjs(HIH-XD) 

HI high to (X)D three state 

15 

ns 

(9) 

tdis(H3H-XA) 

H3 high to (X)A three state 

10 

ns 

(10) 

td(H3H- 

CONTROLH) 

H3 high to control signals 
high 

10 

ns 

(11) 

td(HIH-IACKH) 

HI high to lACK high 

10 

ns 

(12) 

tdis(RESETL- 

ASYNCH) 

RESET low to 

asynchronously reset signals 
three state 

15 

ns 
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RESET OR FETCH FIRST 



Figure A-15. RESET and INT(3-0) Response Timing 


Table A-16. Information for Figure A-15 


IMo. 

Name 

Description 

Min Typ Max 

Unit 

(1) 

tsu(INT) 

INT(3-0) setup before HI low 

10 

ns 

(2) 

tw(INT)/Vote 1 

Interrupt pulse width to 
guarantee one interrupt seen 


ns 


Note 1: Interrupt pulse width must be at least 1 P wide to guarantee it will be seen. It must be less than 
2 P wide to guarantee it will be responded to only once. The recommended pulse width is 1.5 P. 


A-D 


A-19 





















Appendix A - TMS320C30 Dimensions and Timing Specifications 



Figure A-16. BACK Timing 


Table A-17. Information for Figure A-16 


No. 

Name 

Description 

IN/lin Typ Max 

Unit 

(1) 

td(HIH-IACKL) 

HI high to TaEk low 

10 

ns 

(2) 

tci(HIH-IACKH) 

HI high to lACK high during 
first cycle of lACK instruction 
data read 

10 

ns 


A-D 


A-20 
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-i-j ^^(11) 


! J 




NOTE: Timing diagrams show operation with CLKXP = CLKRP = FSXP = FSRP = 0. 


Figure A-18. Fixed Data Rate Mode 







NOTES: 1. Timing diagrams show operation with CLKXP = CLKRP = FSXP = FSRP = 0. 

2. Timings not expressly specified for variable data rate mode are the same as those for fixed data rate mode 


Figure A-19. Variable Data Rate Mode 
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Table A-18. Serial Port Timing as Shown in Figures A-18 and A-19 


No. 

Name 

Description 

Max Min 

Unit 

(1) 

td(H1-SCK) 

HI high to internal CLKX/R. 

15 

ns 

(2) 

tc(SCK) 

CLKX/R cycle 

time. 

CLKX/R external 

tc(H)x2.6 

ns 

CLKX/R internal 

tc(H)x234 tc(H)x2 

(3) 

tw(SCK) 

CLKX/R high/ 

low pulsewidth 

CLKX/R external 

tc(H)+5 

ns 

CLKX/R internal 

tc(SCK)/2 [tc(SCK)/2]-15 

(4) 

tr(SCK) 

CLKX/R rise time. 

8 

ns 

(5) 

tf(SCK) 

CLKX/R fall time 

8 

ns 

(6) 

td(DX) 

CLKX to DX 

valid. 

CLKX external 

35 

ns 

CLKX internal 

20 

(7) 

tsu(DR) 

DR setup before 

CLKR. 

CLKR external 

10 

ns 

CLKR internal 

25 

(8) 

th(DR) 

DR hold from 

CLKR. 

CLKR external 

10 

ns 

CLKR internal 

-5 

(9) 

td(FSX) 

CLKX to internal 

FSX. 

CLKX external 

32 

ns 

CLKX internal 

17 

(10) 

tsu(FSR) 

FSR setup 

before CLKR. 

CLKR external 

10 

ns 

CLKR internal 

10 

(11) 

th(FS) 

FSX/R input 

hold from 
CLKX/R. 

CLKX/R external 

10 

ns 

CLKX/R internal 

-5 

(12) 

tsu(FSX) 

External FSX 

setup before 
CLKX. 

CLKX external 

[tc(CLKX)/2]-10 -[tc(H)-8] 

ns 

CLKX internal 

tc(CLKX)/2 -[tc(H)-21] 

(13) 

td(CH-DX)V 

CLKX to first 

DX bit, FSX 
precedes CLKX. 

CLKX external 

36 

ns 

CLKX internal 

21 

(14) 

td(FSX-DX)V 

FSX to first DX bit, CLKX precedes 
FSX. 

36 

ns 


A-D 


A-24 
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Figure A-20. HOLD/HOLDA Timing 



Table A-19. Information for Figure A-20 


Min Max 


15 


Name 

Description 

tsu(HOLD) 

HOLD valid before H1 low 

tv(HOLD) 

HOLD valid after H1 low 

tw(HOLD) 

HOLD low width 


tw(HOLDA) 


td(H1L-SH)H 


tdis(HIL-S) 


tdis(HIL-RW) 


ten(HIL-RW) 


tdis(HIL-A) 


tdis(HIH-D) 


HOLDA low width 


HI low to STRB high for a 
HOLD 


H1 low to STRB three state 


HI low to STRB active 


HI low to R/W three state 


HI low to R/W active 


H1 low to address three state 


H1 low to address valid 


HI high to data three state 



A-25 
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Appendix B 

Development Support/Part Order Information 


This section provides development support information, device part numbers, 
and support tool ordering information for the TMS320C30 (third-generation 
TMS320) processor. Figure B-1 shows the software and hardware develop¬ 
ment tools available and the development environment for the TMS320C30. 
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Figure B-1. TMS320C30 Development Environment 
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Extensive documentation, including data sheets, user's guides, and application 
reports, is available to support DSP design. A series of DSP textbooks has 
been published both by Prentice-Hall and John Wiley and Sons to support 
research and education. Other support includes a technical support hotline 
(713-274-2320) and a bulletin board service (713-274-2323). Tl's Regional 
Technology Centers (RTCs) provide hands-on workshops and design ser¬ 
vices. 

Many third-parties and consultants with DSP expertise can assist in various 
application areas. TMS320C30 Algorithm Development Packages will be 
available from multiple third-parties and consultants in the near future. Sub¬ 
scribe to the DSP newsletter "Details on Signal Processing" for up to date 
information on new products and services from third-parties and consultants. 
Call Tl's Customer Response Center at (800) 232-3200 to subscribe to the 
newsletter. Contact the nearest Tl field sales office for support tool availability 
or further details (see list of sales offices and distributors at end of book). 

The major topics discussed in this section are listed below. 

• TMS320C30 Development Support (Section B.1 on page B-4) 

- Macro Assembler/Linker 

- C Compiler 

- Simulator 

“ Extended Development System (XDS1000) 

- TMS320 DSP Hotline/Bulletin Board Service 

• TMS320C30 Part Order Information (Section B.2 on page B-12) 

“ Device part numbers 

“ Software and hardware support tools part numbers 

- Device and support tool prefix designators 

~ Device nomenclature 


A-D 
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B.1 TMS320C30 Development Support 

Texas Instruments offers extensive development support and complete doc¬ 
umentation with the TMS320C30 (third-generation) digital signal processor. 
Tools are provided for the TMS320C30 to evaluate the performance of the 
processor, develop algorithm Implementations, and fully integrate the design's 
software and hardware modules. Development operations are performed with 
the TMS320C30 Macro Assembler/Linker, C Compiler, Simulator, and Emu¬ 
lator (Extended Development System - XDS1000). 

A description and key features for each TMS320C30 development support 
tool is provided in the following subsections. For ordering information, see 
Section B.2. 


B.1.1 Macro Assembler/Linker 

The TMS320C30 Macro Assembler/Linker is a software tool that converts 
source mnemonics to executable object code. 

The following key features distinguish the TMS320C30 Macro 
Assembler/Linker: 

• Macro capabilities and library functions 

• Conditional assembly 

• Relocatable modules 

• Complete error diagnostics 

• Symbol table and cross reference 

The TMS320C30 Macro Assembler/Linker is shipped with four programs to 
address specific needs. They are: 

1) The assembler 

2) The archiver 

3) The linker 

4) The object format converter 

These programs and their functionality are described In the following para¬ 
graphs. 


A-D 


• The assembler translates assembly language source files Into machine 
language object files. Source files can contain instructions, assembler 
directives, and macro directives. Assembler directives can be used to 
control various aspects of the assembly process, such as the source list¬ 
ing format, data alignment, and section content. 

• The archiver allows collection of a group of files Into a single archive 
file. For example, several macros can be collected together into a macro 
library. The assembler will search through the library and use the mem¬ 
bers that are called as macros by the source file. It is also possible to 
use the archiver to collect a group of object files Into an object library. 
The linker will include the members in the library that resolve external 
references during the link. 


B-4 




Appendix B - TMS320C30 Development Support 


• The linker combines object files into a single executable object module. 
As it creates the executable module, it performs relocation and resolves 
external references. The linker accepts relocatable object files (created 
by the assembler) as input. It also accepts archive library members and 
output modules created by a previous linker run. Linker directives allow 
combining of file sections, binding of sections or symbols to addresses, 
and defining of global symbols. 

• The main purpose of this development process is to produce a module 
that can be executed in a system that contains a TMS320C30 device 
or the software or hardware development tools. (Note that only 
linked files can be executed). 

• Most EPROM programmers do not accept assembler/linker files as input. 
The object format converter converts the object file into Intel, Tek¬ 
tronix, or Tl-tagged object format. The converted file can be down¬ 
loaded to an EPROM programmer. This EPROM code can then be 
executed on the TMS320C30 device. 

Refer to Figure B-1 for a diagram of the development environment when using 
the Assembler/Linker. 

The macro assembler/linker is currently available for PC/MS-DOS, VAX VMS, 
SUN-3 UNIX , and VAX ULTRIX operating systems. 

B.1.2 C Compiler 

The optimizing C compiler is a full implementation of the standard Kernighan 
and Ritchie C. The compiler accepts a digital signal processing program 
written in C language. It outputs TMS320C30 assembly language source 
code which is then processed by the assembler where the TMS320C30 mne¬ 
monics are converted to object code. 

This high-level language compiler allows time-critical routines written In as¬ 
sembly language to be called from within the C program. The converse is also 
available; assembly routines may call C functions. The output of the compiler 
can be edited prior to assembly/link to further optimize the performance of the 
code. The compiler supports the insertion of assembly language code into C 
source code. The result is a compiler that allows the relative amounts of 
high-level programming and assembly language code to be tailored according 
to the application. Refer back to Figure B-1 for a diagram of the development 
environment when using the C compiler. 

The compiler is currently available for PC/MS-DOS, VAX VMS, SUN-3 UNIX, 
and VAX ULTRIX operating systems. The assembler/linker is Included with the 
shipment of the TMS320C30 C compiler. The output of this assembler/linker 
can be downloaded and used with the simulator, XDS, or PROM programmer. 
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B.1.3 Simulator 

The TMS320C30 Simulator is a software program that simulates operation of 
the TMS320C30. 

The following features highlight simulator capability for effective TMS320C30 
software development: 

• Simulates the entire TMS320C30 digital signal processor instruction set 

• Simulates the key TMS320C30 peripheral features (DMA, timers, and 
serial port) 

• Command entry from either menu-driven keystrokes (menu mode) or 
from a batch file (line mode) 

• Help menus for all screen modes 

• Standard interface can be user customized 

• Simulation parameters quickly stored/retrieved from files to facilitate 
preparation for individual sessions 

• Reverse assembly allows editing and re-assembly of source statements 

• Memory can be displayed (at same time) as: 

- hexadecimal 32-bit values 
assembled source 

• Execution modes include: 

- single/multiple Instruction count 

- single/multiple cycle count 

- until condition is met 

- while condition exists 

~ for set loop count 

- unrestricted run with halt by key input 

• Easy to define trace expressions 

• Trace execution with display choices of: 

- designated expression values 

- cache registers 

- instruction pipeline for easy optimization of code 

• Breakpoint conditions include: : 

- address read 

- address write 

- address read or write 

address execute 

- expression valid 

• Simulates cache utilization 

• Cycle counting 

- display the number of clock cycles in single step or run mode 

- external memory can be configured with wait states for accurate 

cycle counting 

The simulator allows verification and monitoring of the state of the processor. 
Simulation speed Is on the order of thousands of instructions per 
second (VAX/VMS, VAX/ULTRIX, and SUN-3 UNIX) or hundreds of in¬ 
structions per second (PC/MS-DOS). 
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The simulators use TMS320C30 object code, produced by the Macro 
Assembler/Linker. Input and output files may be associated with the port 
addresses of the I/O Instructions in order to simulate I/O devices connected 
to the processor. Before Initiating program execution, breakpoints may be set, 
and the trace format defined. 

During program execution, the internal registers and memory of the simulated 
TMS320C30 are modified as each instruction Is interpreted by the host com¬ 
puter. Execution is suspended when one of the following conditions exists: 

1) A breakpoint or error is encountered. 

2) Execution is halted. 

Once program execution is suspended, the Internal registers and both program 
and data memories can be inspected and/or modified. The trace memory can 
also be displayed. A record of the simulation session can be maintained In a 
journal file, so that it can be re-executed to regain the same machine state 
during another simulation session. 

The user interface in the simulator is identical to that in the XDS. See Figure 
B-2 for an example of the user Interface. 
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Expressions (OE commands) 
Files (DF command) 

Memory (DM command) 
Symbols (OS command) 
(And other displays.) 


Figure B-2. TMS320C30 Simulator User Interface 


The simulator is currently available from Tl for PC/MS-DOS, VAX VMS, and 
VAX ULTRIX operating systems. A SUN-3 UNIX version of the simulator can 
be purchased from a third party: Spectron Microsystems Inc. This version is 
the same as Tl's simulator for the PC/MS-DOS, VAX VMS, and VAX ULTRIX. 
Contact Spectron at (805 967-0503) for more information. 
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B.1.4 TMS320C30 Emulator - Extended Development System (XDS1000) 

The TMS320C30 Emulator (XDS1000) is a user-friendly system that has all 
the features necessary for full-speed emulation to debug hardware, software, 
or integrate the software with the hardware. Some of the XDS1 OOO's features 
include; 

• Full-speed execution and monitoring out of the customers target system 
via a 12 pin target connector 

• Software breakpoint 

• Software trace 

• Software timing capabilities 

• Single-step execution 

• Inspect/modify registers and program/data memory 

• Upload/download capabilities to/from data/program memory 

• Windowed user interface similar to the TMS320C30 simulator 

Full-speed execution and monitoring of the customers target system via a 12 
pin target connector has the advantage of using a serial scan path to give ac¬ 
cess to the internal registers as well as internal and external memory of the 
device. Since execution is out of the TMS320C30 located in the target sys¬ 
tem, there is no timing difference during emulation. 

Software breakpoints means the program can be stopped on a specific ad¬ 
dress. When the program counter reaches the designated breakpoint address, 
the emulator will halt execution and allow the user to observe the status of the 
TMS320C30 (i.e.. Inspect memory or registers). Software trace allows view¬ 
ing of the TMS320C30's state when a breakpoint is reached. This information 
can be saved to a file for future analysis. Software timing permits keeping 
track of clock ticks between breakpoints or while program single stepping. 

The XDS1000 consists of two full-size PC-XT/AT cards. One card is the 
TMS320C30 XDS1000 Controller Card, the other is the TMS320C30 
XDS1000 Development Board. 

The TMS320C30 XDS1000 Controller Card Is responsible for interpreting 
commands sent from the PC and converting those commands into appropriate 
signal sequences to control the TMS320C30 in the user's target system. 

The TMS320C30 XDS1000 Development Board Is a predefined target system 
that contains: 

• A TMS320C30 device 

• 16K X 32-bits full-speed (zero wait state) SRAM on the primary bus 

• Two selectable banks of 8K x 32-bits full-speed (zero wait state) SRAM 
on the expansion bus 

See figure Figure B-3 for a visual representation of the TMS320C30 
XDS1 OOO's development environment. 
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Figure B-3. T!VIS320C30 XDS1000 Development Environment 


A-D 


This figure shows the two environments in which the TMS320C30 XDS1000 
can operate: 

1) The hardware/software development configures the TMS320C30 
XDS1000 and the user's target system in the emulator mode. Section 
13.5 of this document shows the 12-pin header or emulator connector 
necessary for the user's target system to work with the TMS320C30 
XDS1000. 

2) The algorithm development environment allows the user to debug his 
software before the user's target system is built. In this configuration, 
the TMS320C30 XDS1000 Development Board can be used in place of 
the user's target system. In this mode, code can be downloaded Into the 
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memory on the TI\/IS320C30 XDS1000 Development Board and execute 
at full speed. 

To use the TMS320C30 XDS1000, the following equipment is required: 

• IBM PC-XT/AT compatible 

• Two and one-half eight-bit slots for the PC-AT, three full-size eight-bit 
slots for the PC-XT 

• A minimum of 640K bytes of memory in the PC 

• PC/MS DOS rev 2.0 or later 

In summary, the TMS320C30 XDS1000 Is a full-speed emulator that comes 
with a pre built target system for early design development. The TMS320C30 
XDS1000 can help debug hardware In realtime, debug software in realtime, 
and integrate the hardware and software together. 

B.1.5 TMS320 DSP Hotline/Bulletin Board Service 

The TMS320 group at Texas Instruments provides a DSP Hotline to answer 
TMS320 technical questions such as device problems, development tools, 
documentation, upgrades, and new TMS320 products. The hotline is open 
five days a week from 8:00 AM to 6:00 PM Central Time. The phone number 
is (713) 274-2320. For pricing and availability of TMS320 devices and de¬ 
velopment tools, contact the nearest Tl sales office. To order literature, call the 
Customer Response Center (CRC) at (800) 232-3200. 

The TMS320 DSP Bulletin Board Service is a telephone-line computer bulletin 
board that provides access to information pertaining to TMS320 devices. 
Specification updates for current or new TMS320 devices and development 
tools are communicated via the bulletin board as the information becomes 
available. The Bulletin Board Service can be accessed by dialing (713) 
274-2323 with a 300, 1200, or 2400-bps modem. 

The bulletin board contains TMS320C30 source code from Section 12 of the 
TMS320C30 Users Guide as well as development tool and silicon revisions 
and enhancements. The bulletin board also provides new DSP application 
software as it becomes available. See the TMS320 Family Development Sup¬ 
port Reference Guide for further information on how to access the bulletin 
board. 
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B.2 TMS320C30 Part Order information 

This section provides the device and support tool part numbers. Table B-1 
lists the part numbers for the TMS320C30, and Table B-2 gives ordering in¬ 
formation for TMS320C30 hardware and software support tools. A discussion 
of the TMS320 family device and development support tool prefix designators 
is included to assist in understanding the TMS320 product numbering system. 

Table B>1. TMS320C30 Digital Signal Processor Part Numbers 


DEVICE 

TECHNOLOGY 

OPERATING 

FREQUENCY 

PACKAGE 

TYPE 

TYPICAL 

DISSIPATION 

tTMX320C30GBH 

I.O-pm CMOS 

33 MHz 

Ceramic 180-pin PGA 

1.5 W 


tMilitary version planned; contact nearest sales office for availability. 


Table B-2. TMS320C30 Support Tool Part Numbers 


TOOL DESCRIPTION 

OPERATING SYSTEM 

PART NUMBER 

SOFTWARE 

Macro Assembler/Linker 

VAX VMS 

PC/MS-DOS 

SUN-3 UNIX " 

VAX ULTRIX 

TMDX3243250-08 

TMDX3243850-02 

TMDX3243550-08 

TMDX3243260-08 

C Compiler & Macro Assembler/ 
Linker 

VAX VMS 

PC/MS DOS 

SUN-3 UNIX * 

VAX ULTRIX 

TMDX3243255-08 

TMDX3243855-02 

TMDX3243555-08 

TMDX3243265-08 

Simulator 

VAX VMS 

PC/MS-DOS 

SUN-3 UNIX * 

VAX ULTRIX 

TMDX3243251-08 
TMDX3243851-02 
Offered by 

Spectron Inc. 

(805) 967-0503 
TMDX3243261-08 

HARDWARE 

XDS1000 

PC/MS-DOS 

TMDX3261030 


* Please note SUN UNIX support for TMS320C30 software tools is for the 68000 family 
based SUN-3 series workstations. These tools are NOT SUPPORTED on the SUN-4 
series machines that use the SPARC processor, or the SUN-386i series of workstations. 
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Appendix B - TMS320C30 Part Order Information 


B.2.1 Device and Development Support Tool Prefix Designators 

To assist the user in understanding the stages in the product development 
cycle, Texas Instruments assigns prefix designators in the part number no¬ 
menclature. A device prefix designator has three options: TMX, TMP, and 
TMS, and a development support tool prefix designator has two options: 
TMDX and TMDS. These prefixes are representative of the evolutionary stages 
of product development from engineering prototypes (TMX/TMDX) through 
fully qualified production devices (TMS/TMDS). This development flow is 
defined below. 

Device Development Evolutionary Flow: 

TMX Experimental device that is not necessarily representative of the final 
device's electrical specifications. 

TMP Final silicon die that conforms to the device's electrical specifications 
but has not completed quality and reliability verification. 

TMS Fully qualified production device. 

Support Tool Development Evolutionary Flow: 

TMDX Development support product that has not yet completed Texas In¬ 
struments internal qualification testing. 

TMDS Fully qualified development support product. 


TMX and TMP devices and TMDX development support tools are shipped 
with the following disclaimer: 

"Developmental product is intended for internal evaluation purposes." 


Note: 

Texas Instruments recommends that prototype devices (TMX or TMP) not 
be used In production systems since their expected end-use failure rate is 
undefined but predicted to be greater than standard qualified production 
devices. 


TMS devices and TMDS development support tools have been fully charac¬ 
terized and the quality and reliability of the device has been fully demon¬ 
strated. Texas Instruments standard warranty applies. 
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B.2.2 Device Nomenclature 


In addition to the prefix, the device nomenclature includes a suffix that follows 
the device family name. This suffix Indicates the package type (e.g., N, FN, 
or GB) and temperature range (e.g., L). Figure B-4 provides a legend for 
reading the complete device name for any TMS320 family member. 


TMX 320 


PREFIX- 

TMX = experimental device 
TMP = prototype device 
TMS = qualified device 
SMJ = MIL-STD-883C 


DEVICE FAMILY — 

320 = TMS320 family 


TECHNOLOGY- 

C = CMOS 
E = CMOS EPROM 
No letter = NMOS 


C 30 GB H 

L TEMPERATURE RANGE 

H = 0 to BOX 
L = 0 to 70X 
S = -55 to 100X 
M = -55 to 125X 
A = -40 to 85X 

— PACKAGE TYPE 

N = plastic DIP 
JD = ceramic DIP 
side-brazed 

FN = plastic leaded CC 
GB = ceramic PGA 
FJ = ceramic leaded CC 
FD = leadless ceramic CC 


DEVICE- 

1st-gen. DSP: 
10 
15 
17 

2nd-gen. DSP: 
20 
25 


3rd-gen. DSP: 
30 


Figure B-4. TMS320 Device Nomenclature 
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Appendix C 

Instruction Opcodes 


The opcode fields for all the TMS320C30 instructions are shown in Table C-1. 
Bits in the table marked with a hyphen are defined in the individual instruction 
description (see Section 11). Table C-1 along with the Instruction de¬ 
scriptions fully define the instruction words. The opcodes are listed in nu¬ 
merical order. 


Table C-1. TMS320C30 Instruction Opcodes 


INSTRUCTION 

31 

30 

29 

28 

27 

26 

25 

24 

23 

ABSF 

0 

0 

0 

0 

0 

0 

0 

0 

0 

ABSI 

0 

0 

0 

0 

0 

0 

0 

0 

1 

ADDC 

0 

0 

0 

0 

0 

0 

0 

1 

0 

ADDF 

0 

0 

0 

0 

0 

0 

0 

1 

1 

ADD! 

0 

0 

0 

0 

0 

0 

1 

0 

0 

AND 

0 

0 

0 

0 

0 

0 

1 

0 

1 

ANDN 

0 

0 

0 

0 

0 

0 

1 

1 

0 

ASH 

0 

0 

0 

0 

0 

0 

1 

1 

1 

CMPF 

0 

0 

0 

0 

0 

1 

0 

0 

0 

CMPI 

0 

0 

0 

0 

0 

1 

0 

0 

1 

FIX 

0 

0 

0 

0 

0 

1 

0 

1 

0 

FLOAT 

0 

0 

0 

0 

0 

1 

0 

1 

1 

IDLE 

0 

0 

0 

0 

0 

1 

1 

0 

0 

LDE 

0 

0 

0 

0 

0 

1 

1 

0 

1 

LDF 

0 

0 

0 

0 

0 

1 

1 

1 

0 

LDFI 

0 

0 

0 

0 

0 

1 

1 

1 

1 

LDI 

0 

0 

0 

0 

1 

0 

0 

0 

0 

LDII 

0 

0 

0 

0 

1 

0 

0 

0 

1 

LDM 

0 

0 

0 

0 

1 

0 

0 

1 

0 

LSH 

0 

0 

0 

0 

1 

0 

0 

1 

1 

MPYF 

0 

0 

0 

0 

1 

0 

1 

0 

0 

MPYI 

0 

0 

0 

0 

1 

0 

1 

0 

1 

NEGB 

0 

0 

0 

0 

1 

0 

1 

1 

0 

NEGF 

0 

0 

0 

0 

1 

0 

1 

1 

1 

NEGI 

0 

0 

0 

0 

1 

1 

0 

0 

0 

NOP 

0 

1° 

0 

0 

1 

1 

0 

0 

1 
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Table C-1. TMS320C30 Instruction Opcodes (Continued) 

















































Appendix C - Instruction Opcodes 



anEiH 

aaaB 
avans 
ai 
Bl 


Table C-1. TMS320C30 Instruction Opcodes (Continued) 


INSTRUCTION 


MPYF3 


MPYI3 


0R3 


SUBB3 


SUBF3 


SUBI3 


TSTB3 


X0R3 


LDfcond 


LD\ con d 


BR(D)t 


CALL 


RPTB 


SWI 


BcondiD)t 


DBcond{D)t 


CALLcond 


THAPcond 


RETlcond 


RETScond 


Epn 

E 




MPYF31ISUBF3 1 0 0 0 0 

1 0 0 0 0 

1 0 0 0 0 

1 0 0 0 0 


MPYI3I1ADDI3 1 0 0 

1 0 0 

1 0 0 

1 0 


MPYI3I1SUBI3 1 0 0 0 1 

1 0 0 0 1 

1 0 0 0 1 

1 0 0 0 1 


0 

0 

0 

1 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

1 



0 

0 

0 

1 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

1 


STFIISTF 


STlllSTI 


LDFIILDF 


t Opcode same for standard and delayed instructions, 
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Appendix C - Instruction Opcodes 


Table C>1. TMS320C30 Instruction Opcodes (Concluded) 











































Appendix D 

Quality and Reliability 


The quality and reliability performance of Texas Instruments Microprocessor 
and Microcontroller Products, which includes the three generations of 
TMS320 digital signal processors, relies on feedback from: 

• Our customers 

• Our total manufacturing operation from front-end wafer fabrication to 
final shipping inspection 

• Product quality and reliability monitoring. 

Our customer's perception of quality must be the governing criterion for 
judging performance. This concept Is the basis for Texas Instruments Corpo¬ 
rate Quality Policy, which is as follows: 

"For every product or service we offer, we shall define the require¬ 
ments that solve the customer's problems, and we shall conform to 
those requirements without exception." 

Texas Instruments offers a leadership reliability qualification system, based on 
years of experience with leading-edge memory technology as well as years of 
research Into customer requirements. Quality and reliability programs at Tl are 
therefore based on customer input and Internal information to achieve con¬ 
stant Improvement in quality and reliability. 
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Reliability Stress Tests 


D.1 Reliability Stress Tests 

Accelerated stress tests are performed on new semiconductor products and 
process changes to ensure product reliability excellence. The typical test en¬ 
vironments used to qualify new products or major changes In processing are: 

• High-temperature operating life 

• Storage life 

• Temperature cycling 

• Biased humidity 

• Autoclave 

• Electrostatic discharge 

• Package integrity 

• Electromigration 

• Channel-hot electrons (performed on geometries less than 2.0 pm). 

Typical events or changes that require Internal requalification of product in¬ 
clude: 

• New die design, shrink, or layout 

• Wafer process (baseline/control systems, flow, mask, chemicals, gases, 
dopants, passivation, or metal systems) 

• Packaging assembly (baseline control systems or critical assembly 
equipment) 

• Piece parts (such as lead frame, mold compound, mount material, bond 

wire, or lead finish) 

• Manufacturing site. 

Tl reliability control systems extend beyond qualification. Total reliability 
controls and management Include product reliabily monitor as well as final 
product release controls. MOS memories, utilizing high-density active ele¬ 
ments, serve as the leading indicator In wafer-process integrity at Tl MOS fa¬ 
brication sites, enhancing all MOS logic device yields and reliability. Tl places 
more than several thousand MOS devices per month on reliability test to en¬ 
sure and sustain built-in product excellence. 

Table D-1 lists the microprocessor and microcontroller reliability tests, the 
duration of the test, and sample size. The following defines and describes 
those tests in the table. 

AOQ (Average Outgoing Quality) Amount of defective product in a pop¬ 
ulation, usually expressed In terms of 
parts per million (PPM). 

Estimated field failure rate in number 
of failures per billion power-on device 
hours; 1000 FITS equals 0.1 percent 
fail per 1000 device hours. 

Device dynamically exercised at a high 
ambient temperature (usually 125X) 
to simulate field usuage that would 


FIT (Failure In Time) 


Operating lifetest 
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High-temperature storage 


Biased humidity 


Autoclave (pressure cooker) 


Temperature cycle 


Thermal shock 


FIND 


Mechanical Sequence: 

Fine and gross leak 
Mechanical shock 

FIND (optional) 

Vibration, variable frequency 

Constant acceleration 

Fine and gross leak 


expose the device to a much lower 
ambient temperature (such as 55X). 
Using a derived high temperature, a 
55X ambient failure rate can be cal¬ 
culated. 

Device exposed to 150X unbiased 
condition. Bond Integrity Is stressed in 
this environment. 

Moisture and bias used to accelerate 
corrosion-type failures in plastic 
packages. Conditions Include 85X 
ambient temperature with 85-percent 
relative humidity (RH). Typical bias 
voltage Is +5 V and ground on alter¬ 
nating pins. 

Plastic-packaged devices exposed to 
moisture at 121X using a pressure of 
one atmosphere above normal pres¬ 
sure. The pressure forces moisture 
permeation of the package and accel¬ 
erates corrosion mechanisms (if pres¬ 
ent) on the device. External package 
contaminates can also be activated 
and caused to generate inter-pin cur¬ 
rent leakage paths. 

Device exposed to severe temperature 
extremes in an alternating fashion 
(-65X for 15 minutes and 150X for 
15 minutes per cycle) for at least 1000 
cycles. Package strength, bond qual¬ 
ity, and consistency of assembly pro¬ 
cess are stressed in this environment. 

Test similar to the temperature cycle 
test, but involving a liquid-to-liquid 
transfer, per MIL-STD-883C, Method 
1011. 

Particle Impact Noise Detection test. 
A non-destructive test to detect loose 
particles inside a device cavity. 


Per MIL-STD-883C, Method 1014.5 
Per MIL-STD-883C, Method 2002.3, 
1500 g, 0.5 ms. Condition B 
Per MIL-STD-883C, Method 2020.4 
Per MIL-STD-883C, Method 2007.1, 
20 g. Condition A 

Per MIL-STD-883C, Method 2001.2, 
20 kg. Condition D, Y1 Plane min 
Per MIL-STD-883C, Method 1014.5 
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Electrical test 

Thermal Sequence: 

Fine and gross leak 
Solder heat (optional) 
Temperature cycle 
(10 cycles minimum) 

Thermal shock 
(10 cycles minimum) 

Moisture resistance 
Fine and gross leak 
Electrical test 

Thermal/Mechanical Sequence: 

Fine and gross leak 
Temperature cycle 
(10 cycles minimum) 

Constant acceleration 

Fine and gross leak 
Electrical test 

Electrostatic discharge 
Solderability 
Solder heat 

Salt atmosphere 

Lead pull 

Lead integrity 

Electromigration 


Resistance to solvents 


To data sheet limits 


Per MIL-STD-883a Method 1014.5 
Per MIL-STD-750a Method 1014.5 
Per MIL-STD-883a Method 1010.5, 
-65 to +150X, Condition C 
Per MIL-STD-883C, Method 1011.4, 
-55 to +125X, Condition B 
Per MIL-STD-883C, Method 1004.4 
Per MIL-STD-883C, Method 1014.5 
To data sheet limits 


Per MIL-STD-883C, Method 1014.5 
Per MIL-STD-883C, Method 1010.5, 
-65 to +150'’C, Condition C 
Per MIL-STD-883C, Method 2001.2, 
30 kg, Y1 Plane 

Per MIL-STD-883C, Method 1014.5 
To data sheet limits 

Per MIL-STD-883C, Method 3015 
Per MIL-STD-883C, Method 2003.3 
Per MIL-STD-750C, Method 2031, 
10 sec 

Per MIL-STD-883C, Method 1009.4, 
Condition A, 24 hrs min 
Per MIL-STD-883C, Method 2004.4, 
Condition A 

Per MIL-STD-883C, Method 2004.4, 
Condition B1 

Accelerated stress testing of con¬ 
ductor patterns to ensure acceptable 
lifetime of power-on operation 
Per MIL-STD-883C, Method 2015.4 
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Table D>1. Microprocessor and Microcontroller Tests 


TEST 

DURATION 

SAMPLE SIZE 
PLASTIC CERAMIC 

Operating life, 125‘‘C, 5.0 V 

1000 hrs 

129 

129 

Operating life, 150X, 5.0 V 

1000 hrs 

IT 

77 

Storage life, 150X 

1000 hrs 

11 

77 

Biased 85X/85 percent RH, 5.0 V 

1000 hrs 

129 

- 

Autoclave, 121X, 1 ATM 

240 hrs 

77 

- 

Temperature cycle, -65 to 150X 

1000 cyct 

129 

129 

Temperature cycle, 0 to 125X 

3000 eye 

129 

129 

Thermal shock, -65 to 150X 

200 eye 

129 

129 

Electrostatic discharge, ±2 kV 

12 

12 

Latch-up (CMOS devices only) 


5 

5 

Mechanical sequence 


- 

38 

Thermal sequence 


- 

38 

Thermal/mechanical sequence 


- 

38 

FIND 


- 

45 

Internal water vapor 


- 

3 

Solderability 


22 

22 

Solder heat 


22 

22 

Resistance to solvents 


15 

15 

Lead integrity 


15 

15 

Lead pull 


22 

- 

Lead finish adhesion 


15 

15 

Salt atmosphere 


15 

15 

Flammability (UL94-V0) 


3 

- 

Thermal impedance 


5 

5 


*lf junction temperature does not exceed plasticity of package. 
tFor severe environments; reduced cycles for office environments. 


Table D-2 lists the TMS320C30 device, the approximate number of transis¬ 
tors, and the equivalent gates. The numbers have been determined from design 
verification runs. 


Table D-2. TMS320C30 Transistors 


DEVICE 

# TRANSISTORS 

# GATES 

CMOS: TMS320C30 

600K-700K 

200K 


Tl Qualification test updates are available upon request at no charge. Tl will 
consider performing any additional reliability test(s), if requested. For more 
information on Tl quality and reliability programs, contact the nearest Tl field 
sales office. 
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Reliability Stress Tests 


Note: 

Texas Instruments reserves the right to make changes in MOS Semicon¬ 
ductor test limits, procedures, or processing without notice. Unless prior 
arrangements for notification have been made, Tl advises ail customers to 
reverify current test and manufacturing conditions prior to relying on 
published data. 
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Marshall (503) 644-5050; Wyle (503) 640-6000. 
PENNSYLVANIA; Arrow/Kierulff (412) 856-7000, 
(215) 928-1800; GRS Electronics (215) 922-7037; 
Schweber (215) 441-0600, (412) 963-6804. 

TEXAS; Austin: Arrow/Kierulff (512) 835-4180; 
Hall-Mark (512) 258-8848; Marshall (512) 837-1991; 
Schweber (512) 339-0088; Wyle (512) 834-9957; 
Dallas: Arrow/Kierulff (214) 380-6464; 

Hall-Mark (214) 553-4300; Marshall (214) 233-5200; 
Schweber (214) 661-5010; Wyle (214) 235-9953; 
Zeus (214) 783-7010; 

Houston: Arrow/Kierulff (713) 530-4700; 

Hall-Mark (713) 781-6100; Marshall (713) 895-9200; 
Schweber (713) 784-3600; Wyle (713) 879-9953. 

UTAH: Arrow/Kierulff (801) 973-6913; 

Hall-Mark (801) 972-1008; Marshall (801) 485-1551; 
Wyle (801) 974-9953. 


WISCONSIN: Arrow/Kierulff (414) 792-0150; 
Hall-Mark (414) 797-7844; Marshall (414) 797-8400; 
Schweber (414) 784-9020. 

CANADA: Calgary: Future (403) 235-5325; 
Edmonton: Future (403) 438-2858; 

Montreal: Arrow Canada (514) 735-5511; 

Future (514) 694-7710; 

Ottawa: Arrow Canada (613) 226-6903; 

Future (613) 820-8313; 

Quebec City: Arrow Canada (418) 687-4231; 
Toronto: Arrow Canada (416) 672-7769; 

Future (416) 638-4771; 

Vancouver; Future (604) 294-1166; 

Winnipeg: Future (204) 339-0554. 

Customer 
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